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PROLOGUE 

The  Geostationary  Operational  Environmental  Satellite  (GOES)  data 
collection  system  operated  by  the  National  Earth  Satellite  Service 
(NESS)  of  the  National  Oceanic  and  Atmospheric  Administration  currently 
supports  data  collection  from  several  types  of  data  collection  platforms 
(DCPs).  With  the  advent  of  adaptive  random  reporting  platforms,  several 
potential  procedural  and  technical  problems  associated  with  effective 
network  design  have  been  recognized.  This  project  is  intended  to 
address  these  issues  with  regard  to  the  configuration  of  networks  of 
random  reporting  data  collection  platforms  used  to  record  flood  and 
flood  producing  events.  Chapter  1  gives  a  general  overview  and  problem 
statement.  Chapter  2  overviews  DCP  operation  equipment  and  users 
experiences.  Chapter  3  focusses  on  evaluation  of  the  basic  theory  of 
random  reporting  in  the  telecommunications  field  in  order  to  theoreti¬ 
cally  investigate  channel  performance  characteristics  for  random  modes 
of  operation.  Chapter  4  presents  the  theory  behind  the  proposed  data 
collection  network  design  algorithm.  The  available  climatological  data 
is  discussed  in  Chapter  5  and  Chapter  6  is  a  hypothetical  case  study  to 
illustrate  use.  The  Appendices  contain  user  manuals  and  listings  of 
related  computer  programs. 
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CHAPTER  I 


Introduction 


1.0  Introduction 

The  Corps  of  Engineers  has  selected  GOES  to  serve  as  a  communica¬ 
tions  link  for  the  acquisition  of  hydrometeorological  data  (1).  The 
proposed  Data  Collection  System  (DCS)  will  operate  utilizing  a  network 
of  random  reporting  and  self -timed  data  collection  platforms  to  convey 
river  stage,  precipitation  and  other  data  to  a  central  Command  and  Data 
Acquisition  Station  for  processing  and  dissemination  and  also  to  other 
Ground  Receiving  Stations.  This  effort  is  directed  at  enhancing  exist¬ 
ing  flood  control  and  flood  forecasting  services  and  at  supplementing 
the  hydrometeorological  data  base. 

The  GOES  DCS  consists  of  a  set  of  remote  transmitters,  satellites, 
ground  receive  stations  and  data  processing  and  dissemination  equipment 
(1).  This  system  is  supported  and  regulated  by  NOAA  as  an  integral  part 
of  its  environmental  monitoring  capability.  A  brief  description  of  the 
system  is  presented  below. 

1.1  GOES  Data  Collection  System 

Currently  three  geostationary,  meteorological  satellites  are  in 
equatorial  orbit  at  an  altitude  of  approximately  35,600  km  (2)  over  the 
American  Continents  and  adjacent  oceanic  areas.  These  were  developed 
under  the  Synchronous  Meteorological  Satellite  (SMS)  Program,  and  are 
operated  by  NESS. 
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The  satellite  located  at  75°W  longitude  is  known  as  GOES-East  and 
the  satellite  at  135°W  longitude  is  known  as  GOES-West.  A  partially 
failed  satellite,  i.e.,  a  satellite  with  no  imaging  or  sounding  . 
capability,  is  located  at  107°W  longitude  and  is  known  as  GOES-Central . 
GOES-Central  acts  as  an  operational  standby  for  the  DCS  in  the  event  of 
a  failure  of  either  GOES-East  or  West.  During  the  two  annual  spacecraft 
eclipse  periods,  GOES-Central  is  also  used  to  support  DCS  operations. 

Figure  1.1  illustrates  the  current  configuration  and  areal 
coverage  provided  by  the  two  GOES  satellites  servicing  the  United  States  (2). 

These  have  been  in  operation  for  approximately  7  years  (4).  In 
addition,  planning  is  currently  underway  by  the  World  Meteorologic 
Organization  (WMO)  to  implement  a  satellite  data  collection  network 
capable  of  providing  continuous  global  coverage  of  the  earth's  surface. 

It  is  anticipated  that  geostationary  satellites  will  play  a  major  role 
in  such  a  network.  Similar  spacecraft  are  also  supported  by  Japan  and  the 
European  Space  Agency.  The  USSR  plans  to  eventually  operate  a  similar  space¬ 
craft  over  the  Indian  Ocean  to  complete  the  round-the-world  coverage. 

The  GOES  system  performs  several  meteorological  data  collection 
tasks.  The  satellites  provide  near  continuous  imaging  of  the  earth’s 
surface  and  its  cloud  cover  through  visible  infrared  spin  scan  radio¬ 
meters  (if).  They  also  carry  a  Space  Environment  Monitor  (SEM)  to 
measure  energetic  particle  flux.  X-rays  and  the  earth's  magnetic  field 
and  broadcast  Weather  Facsimile  (WEFAX)  data  (2).  The  GOES  Data 
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Figure  1.1:  Areal  coverage  provided  by  two  GOES  satellites  (2) 


Collection  System  also  serves  as  a  communication  link,  illustrated  by 
Figure  1.2,  for  the  collection  of  environmental  data.  Observations  and 
measurements  of  the  physical,  chemical  or  biological  properties  of  the 
oceans,  rivers,  lakes,  solid  earth  and  atmosphere  are  relayed  through 
the  satellite  system. 

The  GOES  data  collection  system  can  relay  messages  from  environmen¬ 
tal  instrumentation  installed  on  spacecraft,  ships,  buoys,  weather 
balloons,  and  land-based  platforms.  The  system  utilizes  transmission 
frequencies  above  400  MHz  to  minimize  ionospheric  interference  (5).  As 
shown  on  Figure  1.2,  two  sets  of  uplink  and  downlink  frequencies  are 
employed,  the  first  at  2034.9  MHz  (uplink)  and  1694.5  MHz  (downlink)  for 
communications  between  spacecraft  and  large  receiver  systems  and  the 
second  at  401.8  MHz  (uplink)  and  468.8  MHz  (downlink)  is  used  for 
communications  with  remote  low  power  transmitters  (4).  The  401.8  MHz 
uplink  capacity  is  divided  into  200,  1.5  KHz,  channels  in  the  domestic 
or  regional  frequency  band  and  33,  3KHz  channels  in  the  international 
frequency  band  which  permit  low  data  rate,  low  power,  remote 
communication  (4). 

The  primary  ground  receive  station  for  the  GOES  data  collection 
network  is  located  at  Wallops  Station,  Virginia.  Major  components  of 
this  facility  include: 

several  receiving  systems  with  parabolic  dish  antennas, 
ranging  in  size  from  24  to  60  feet  diameter, 
multiplexers  capable  of  supporting  up  to  80  separate  channels 
and  an  automatic  Monitoring  System. 
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a  redundant  disk-supported  computer  system  that  acquires  and 
forwards  received  data  to  the  World  Weather  Building  in 
Maryland.  Up  to  16  hours  of  data  may  be  stored  in  the  event  of 
World  Weather  Building  System  failure, 
triply  redundant  lines  to  the  World  Weather  Building, 
uninterruptable  power  sources, 

a  system  by  which  each  channel  is  tested  at  least  once  per  day 
using  a  test  transmitter, 

The  World  Weather  Building  Facility  contains  the  scheduling 
and  dissemination  computer  system  which  allows  up  to  24  hours 
of  data  storage  and  dissemination  via  direct  or  dial-in 
telephone  at  110/3U0,  1200,  2400  and  4800  baud. 


Several  user  specific,  smaller  installations  are  also- in  operation.  In 
general,  these  installations  contain  a  single  parabolic  dish  antenna  and 
receiver  to  collect  selected  GOES  signals.  Data  management  tasks  are 
handled  by  mini -computers  or  micro-processors. 

Data  collection  platforms  (DCP's)  comprise  an  assemblage  of 
electronic  equipment  for  sensing  physical  conditions,  formatting 
messages,  and  transmitting  these  over  an  assigned  channel.  These 
platforms  are  commercially  available  through  several  manufacturers 
providing  users  with  a  variety  of  sensors  and  data  telemetry 
capabilities.  The  GOES  system  currently  allows  three  primary  reporting 
modes:  1)  self  timed,  2)  interrogated,  and,  3)  random  (6). 

In  self-timed  mode  users  may  transmit  during  assigned  time  inter¬ 
vals  or  slots  of  the  order  of  one  minute  duration  (6).  Transmission 
intervals  are  controlled  by  precise  timers  within  each  DCP,  which 
minimize  the  possibility  of  transmission  collisions  resulting  in  lost  or 
erroneous  data.  In  general,  this  allows  each  platform  to  transmit  every 
few  hours.  Although  this  mode  of  operation  simplifies  data  management 
tasks,  only  a  limited  number  of  platforms  may  share  a  single  channel. 
Furthermore,  since  the  precise  time  and  order  in  which  the  self-timed 
DCP's  report  is  predetermined,  no  user  flexibility  is  afforded  by  the 
system  to  adapt  in  real  time  to  changing  environmental  conditions  (6). 
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Interrogated  UCP's  are  designed  to  transmit  in  response  to  signals 
generated  at  the  Command  and  Data  Acquisition  station  and  relayed 
through  GOES  to  the  network  (see  Figure  1.2).  Each  platform  is  assigned 
a  unique  address  which  is  carried  over  an  interrogation  channel  continu¬ 
ously  monitored  by  all  DCP's  of  this  type.  Upon  receiving  its  address, 
the  polled  DCP  transmits  its  message  on  an  assigned  reply  channel.  This 
transmission  mode  allows  for  greater  user  flexibility  than  does  the  self 
timed  mode  and  high  channel  use  efficiency  while  maintaining  a 
comparable  or  better  success  rate  for  each  transmission  (6).  In 
addition,  the  interrogated  system  has  an  event  generated  alert  system 
which  can  be  triggered  by  a  measurement  parameter  exceeding  a  pre-set 
threshold.  When  this  is  received  in  the  NESS  ground  system  and,  within 
approximately  60  seconds,  the  DCP  is  interrogated  and  the  special 
information  is  transmitted  via  the  normal  reply  channel.  Also,  special 
interrogation  schedules  can  be  implemented  as  a  result  of  receipt  of 
these  alert  messages.  Increased  flexibility  is  afforded  with  regard  to 
network  size  and  reporting  frequency.  Networks  are  limited  only  by 
address  length  and  message  duration.  This  capability,  however,  is 
achieved  at  the  expense  of  installing  high  performance  receivers  at  each 
platform.  This  receiver  significantly  impacts  DCP  complexity,  power 
consumption  and,  therefore,  total  costs  (6).  Furthermore,  a  more 
complex  data  management  and  analysis  procedure  is  required. 

A  variant  of  the  self  timed  and  interrogated  transmission  modes  is 
also  available  which  utilizes  a  satellite  controlled  timing  mechanism. 


Under  this  mode  of  operation,  termed  self  timed  with  satellite  control¬ 
led  clock,  an  interrogation  address  code  is  multiplexed  with  a  National 
Bureau  of  Standards  time  code  permitting  each  DCP  to  more  accurately 
determine  the  precise  time  of  day  (6).  In  self  timed  operation,  this 
feature  allows  for  a  reduction  in  the  required  duration  of  time  slots, 
which  can  approach  that  of  the  actual  transmission  duration,  thereby 
increasing  potential  channel  throughput.  However,  this  mode  also 
requires  high  performance  receivers  coupled  with  intelligent  clock 
controllers  at  each  platform  (6).  Therefore,  a  similar  set  of 
disadvantages  exist. 

The  most  recently  developed  mode  of  data  telemetry  available  to 
GOES  DCS  users  is  random  reporting.  One  principal  advantage  is  that  no 
requirement  exists  for  timing  of  transmissions  (6).  This  eliminates  the 
need  for  precise  synchronization  of  timing  instruments  in  DCP  networks. 
Another  advantage  is  that  transmissions  may  be  initiated  as  a  reaction 
to  changing  external  environmental  conditions.  Further,  some  platforms 
can  be  programmed  to  transmit  at  rates  dependent  on  environmental 
conditions.  This  capability,  termed  adaptive  random  reporting,  is 
extremely  valuable  to  users  where  the  timeliness  of  information  is 
critical  to  the  decision  making  process.  In  essence,  with  random 
reporting,  data  transmission  timing  and  frequency  can  be  influenced 
largely  by  the  users  measurement  requirements.  A  third  advantage  is 
that  random  reporting,  obviates  expensive  receivers  at  platforms. 

Random  reporting  DCP's  must  share  channels'  limited  transmission 
capabilities.  Since  each  platform  on  a  channel  transmits  without  regard 
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to  transmissions  from  other  OCP's  sharing  the  same  channel,  there  exists 
the  possibility  of  lost  data  due  to  message  collisions.  Moreover,  data 
management  tasks  at  the  receiving  station  are  somewhat  more  complex, 
inasmuch  as  message  arrival  times  are  not  predictable. 

1.2  Problem  Statement 

High  demand  exists  for  the  development  and  implementation  of  real 
time,  adaptive,  random  reporting  data  networks  (8).  As  will  be  shown  in 
the  following  chapter,  there  are,  in  fact,  many  systems  already  in 
operation,  each  with  definite  plans  for  expansion.  Several  of  these 
present  and  future  users  are  participating  in  a  NESS  sponsored  experi¬ 
ment  to  characterize  the  actual  operational  characteristics  of  the 
random  reporting  systems. 

Guidelines  have  been  developed  to  help  in  the  design  of  random 
reporting  DCP  networks  (6).  They  assume  that  fixed  reporting  rates  for 
all  stations  are  known.  Given  those  assumptions  the  guidelines  (see 
Chapter  3)  provide  relationships  between  probability  of  message 
reception,  number  of  DCP's  in  the  system  and  average  reporting  rate  at 
the  time  interval  of  analysis. 

The  Corps  of  Engineers  and  its  New  England  Division  have  realized 
that  in  practice  the  above  guidelines  are  applicable  to  a  hypothetical 
single  user  of  a  satellite  reply  channel  with  perfectly  known  reporting 
rates.  In  reality,  in  the  near  future,  many  users  (e.g..  Corps 
Districts)  in  different  geographical  regions  will  be  sharing  one  of  the 
few  allotted  satellite  channels  for  random  reporting.  Presently,  there 
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are  only  three  channels  on  each  of  the  East  and  West  satellites  allotted 
to  random  reporting.  Since  each  station  in  each  geographical  region  is 
affected  by  different  climatic  conditions,  the  actual  average  reporting 
rate  of  each  region  varies  for  different  instances  and  is  different  for 
every  user.  In  fact,  reporting  rates  are  a  random,  climatically  driven 
condition.  Given  this  setting  it  would  be  unnecessarily  conservative  to 
assume  that  all  stations  in  the  conterminous  United  States  are  reporting 
at  their  highest  possible  rate.  Under  such  situations  available  channel 
would  be  able  to  handle  but  a  fraction  of  the  projected  number  of 
stations  and  users  with  any  acceptable  level  of  reliability  of  message 
reception.  On  the  other  hand,  any  single  user  cannot  ignore  in  fact 
that  his  message  reliability  will  depend  on  the  reporting 
characteristics  of  his  channel  partners. 

The  goal  of  this  work  is  then  to  provide  a  set  of  guidelines  for 
the  design  of  DCP  networks  that  will  explicitly  account  for  geograph¬ 
ically  and  climatically  different  users.  Such  guidelines  will: 

1)  explicitly  consider  precipitation  variability  over  the 
conterminous  United  States; 

2)  provide  tools  and  procedures  that  will  allow  the  efficient 
allocation  of  partners  to  a  satellite  reply  channel; 

3)  satisfy  NESS's  rules  on  random  reporting  and  the  achievement 
of  pre-specified  levels  of  successful  message  transmission  for 
each  user. 


CHAPTER  2 


Adaptive  Random  Reporting:  A  State  of  the  Art  Review 

2.1  Adaptive  Behavior  of  Data  Collection  Platforms 

As  previously  indicated,  one  of  the  main  advantages  of  random 
reporting  is  the  ability  to  obtain  data  at  a  rate  dependent  on  and 
driven  by  the  parameters  being  monitored.  This  is  called  adaptive 
reporting.  The  New  England  Division  (NED)  of  the  Corps  of  Engineers 

(C  of  E)  specified  the  design  of  its  DCP's  with  a  particular  adaptive 
algorithm  responding  to  streamflow  and  rainfall  rates.  The  form  of  this 
algorithm  is  common  among  available  equipment.  Following  is  a  brief 
discussion  of  the  specifications  NED'S  Data  Collection  Platforms  which 
were  built  by  Synergetics  International,  Inc. 

The  DCP's  should  be  suited  to  data  collection  in  a  variety  of 
fields,  such  as  hydrology,  meteorology,  environmental  quality  and 
geology;  and  capable  of  operating  in  either  of  two  mr  es,  the  conven¬ 
tional  self  timed  mode,  or  the  random  reporting  mode  (7).  In  the  random 
reporting  mode,  the  DCP's  must  perform  the  following  tasks: 

1)  Monitor  environmental  parameters  by  sampling  at  intervals 
which  shall  be  user  selectable  from  seconds  to  hours. 

2)  Calculate  past  time  derivatives  in  the  sampled  parameters. 

3)  On  the  basis  of  these  parameters,  their  past  derivatives,  and 
user  selectable  threshold  values,  calculate  randomized  trans¬ 
mission  intervals. 
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4)  Format  the  latest  value  of  the  sampled  parameters  into  either 
ASCII  or  binary  numbers  containing  integral  multiples  of 

six  bits.  Twelve-bit  binary  words  will  be  sufficiently  large 
for  the  applications  presently  being  planned;  however,  the 
number  of  six-bit  multiples  in  each  parameter  must  be 
user-selectable.  Codes  other  than  binary  and  ASCII  must  be 
attainable  by  future  software  modifications. 

5)  Transmit  the  coded  data  in  a  sequence  of  ji  standard  GOES 
random  messages,  where  ji  may  be  1,  2,  3,  up  to  approximately 
10.  The  number  of  messages  (n)  must  be  user  selectable  and 
is  variable  so  as  to  permit  user  manipulations  which  can 
maximize  probability  of  reception  of  certain  emergency 
data.  (7) 

The  message  format  for  random  transmissions  will  have  the  following 


format  (7). 

Message 

Sub-Section 

Message 

Length  (sec) 

Contents 

Clear  Carrier 

.5 

— 

Clock 

.48 

48  alternating 

1 's  and  0's 

Maximal  Length  Sequence 

,1b 

100010011010111 

BCH  Identifier 

.31 

31  bit  DCP 

identification 

number 

Header 

.08 

Set  by  users 

Data 

.48  (nominal) 

8-bit/by te  GOES  ASCII 

EOT 

.08 

ASCII  EOT,  00000100 
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The  proposed  algorithm  which  a  OCP  must  use  for  calculating  trans¬ 
mission  rate  must  be  equivalent  to: 

RATE  =  MAX [BASE  RATE,  (A*CHANGE  IN  PARAMETER)]  (1) 

i.e.,  the  rate  is  the  larger  of  these  two  expressions:  a  user  selected 
BASE  RATE;  or  the  product  of  a  slope  factor  "A"  times  the  absolute  value 
of  the  change  in  parameter  value,  for  NED  this  parameter  is  river  stage 
(1).  Depending  upon  the  current  stage,  the  BASE  RATE  will  be  one  of 
three  pre-selected  values  corresponding  to  low  flow,  alert,  and  flood 
conditions.  The  slope  factor  "A"  will  serve  to  control  transmission 
rate  during  periods  in  which  conditions  are  changing  rapidly.  In 
addition,  a  DCP  may  not  change  from  short  transmission  intervals  to 
longer  intervals  until  it  has  transmitted  a  set  number  of  times  at  the 
shorter  interval.  This  capability  is  termed  "momentum"  (1).  The  base 
rate  and  the  slope  fraction  will  allow  tailoring  of  transmission  rates 
to  local  conditions. 

Alternative  reporting  algorithms  have  been  developed.  Most 
commercially  available  DCP's  also  operate  in  an  adaptive  random  mode 
based  upon  input  from  multiple  sensors  with  the  following  algorithm: 

RATE  =  BASE  RATE  +  (A*  CHANGE  IN  PARAMETER  )  (Z) 

As  in  the  previous  algorithm,  BASE  RATE  can  generally  have  three  values 
corresponding  to  low,  alert  and  warning  levels.  These  may  be  selected 


for  one  or  more  individual  sensor  inputs.  Commonly,  three  values  of 
"A",  the  slope  ''actor,  are  also  permitted.  Furthermore,  parameter 
groups  specifying  the  data  transmitted  from  multiple  sensors  based  upon 
a  single  driving  parameter  are  user  selected  in  some  equipment.  Al¬ 
though  these  capabilities  could  improve  the  data  collection  potential 
from  a  single  DCP,  network  performance  characteristics  would  not  be 
identical  to  those  governed  by  the  NED  algorithm. 


2.2  Existing  Data  Collection  Platform  Capabilities 

In  order  to  devise  a  useful  and  relevant  analysis  of  the  perfor¬ 
mance  characteristics  of  networks  of  random  reporting  data  collection 
platforms,  it  is  first  necessary  to  understand  the  characteristics  of 
these  devices,  and  the  manner  in  which  they  are  likely  to  be  employed. 
This  information  provides  a  basis  for  structuring  models  of  network 
performance. 

Currently,  there  are  several  companies  which  manufacture  satellite 
linked  data  collection  platforms  capable  of  "random"  reporting.  Some  of 
them  are: 

LaBarge  Electronics 

Handar,  Inc. 

Synergetics,  Inc. 

Sutron  Corporation 

Magnavox 

American  Electronic  Laboratories,  Inc. 

The  first  four  companies'  device(s)  were  contacted.  Their  capabilities 
are  discussed  in  the  paragraphs  that  follow. 
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LaBarge  manufactures  two  different  platform  models,  the  convertible 
DCP  (CDCP)  and  the  Advanced  DCP  (AOCP).  The  CDCP  has  8  analog  and  4 
digital  sensor  capacity  while  the  ADCP  has  12  analog  and  8  digital 
sensor  capacity.  Both  are  capable  of  transmitting  "emergency" 
information  on  a  secondary  channel.  A  single  parameter  is  monitored 
until  the  rate  of  change  of  the  parameter,  or  its  level,  exceeds  a 
prespecified  threshold.  Once  this  occurs,  the  platform  begins 
transmitting  at  a  fixed  rate  on  the  emergency  channel.  It  is  important 
to  note  that  randomness  is  introduced  only  by  the  variable  starting  time 
of  emergency  channel  transmissions.  Once  activated  the  emergency 
channel  transmits  at  a  pre-determined  rate. 

Handar  currently  manufactures  two  platforms  capable  of  random 
reporting.  Once  device  is  an  adaptation  of  their  Model  524,  the  so- 
called  524/B.  This  microprocessor  based  system  can  be  configured  to 
accept  up  to  8  analog  and  4  digital  inputs.  It  operates  in  a  manner 
similar  to  the  LaBarge  platforms.  Once  a  specified  parameter  exceeds  a 
preset  rate  of  change  or  level,  the  platform  can  be  programmed  to  begin 
transmitting  on  the  secondary  channel.  The  transmission  rate  is  equal 
to  the  sensor  scan  rate  --  Handar  has  the  added  capability  of  repeating 
messages  up  to  3  times  for  transmissions  made  on  this  secondary  channel. 
Like  the  LaBarge  unit,  once  the  threshold  event  has  occurred,  the  unit 
transmits  at  a  fixed  rate.  Thus  the  only  randomness  is  that  introduced 
by  the  triggering  event.  If  the  unit  is  programmed  to  operate  in  both 
self  timed  and  random  mode,  Handar  labels  this  "random  reporting";  and 
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if  the  unit  is  proyrammed  to  operate  only  on  the  random  channel,  Handar 
labels  this  "random  adaptive  reporting".  Handar  calls  a  self-timed  or 
slotted  reporting  regime,  using  a  short  message  format,  "self-timed 
emergency  reporting". 

The  second  Handar  data  collection  platform  capable  of  random 
reporting  is  their  recently  introduced  Model  560  Multiple  Data  Access 
Hydrological  System.  This  system  can  accept  up  to  18  separate  analog 
and  12  digital  inputs.  Handar  uses  signal  conditioning  cards  to  inter¬ 
face  with  a  variety  of  sensors.  The  unit  is  capable  of  preprocessing 
observed  data  computing  such  statistics  as  mean,  variance,  minima, 
maxima,  histograms,  rates  of  change,  differences  between  sensor  observa¬ 
tions,  and  scaling  of  data.  In  addition  to  self  timed  modes  of  opera¬ 
tion,  Handar  says  that  their  instrument  is  capable  of  alert  reporting 
(random  or  fixed  time  offset),  random  reporting,  and  random  adaptive 
reporting.  They  claim  that  their  platform  is  configured  to  randomly 
report  exactly  in  the  manner  specified  by  the  NESS  random  reporting 
Users  Guide  [6].  It  appears  that  the  random  reporting  mode  may  be  triggered 
independently  by  any  of  the  sensors.  Each  sensor  has  an  alert  level,  a 
warning  level,  and  a  slope  factor  which  can  be  programmed.  The  platform 
has  a  single  base,  alert,  and  warning  rate  assigned.  It  is  not  clear 
whether  each  sensor  channel  can  initiate  random  reporting  independently, 
or  whether  the  platform  rate  is  determined  by  the  highest  specified 
sensor  rate,  nor  is  it  clear  as  to  how  the  reporting  time  is  randomized. 
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Synergetics  is  the  most  recent  entrant  into  the  fiela  of  random 
data  collection  platform  development  with  their  3400  Series  Hydrological 
Data  Collection  Platform  system.  This  system  is  a  modular,  microproces¬ 
sor  based  system  which  includes  a  master  control  module,  a  GOES 
transmitter  module,  a  power  supply,  and  a  hydrological  sensor  interface 
module.  The  hydrological  sensor  module  is  configured  to  handle  8  analog 
signals,  4  digital  sensors,  1  up/down  counter,  and  1  up  counter  channel. 
Reportedly,  a  single  platform  could  handle  10  to  15  hydrologic  sensor 
modules,  with  14  channels  each.  In  addition,  the  master  control  module 
has  14  internal  channels  which  monitor  the  system  state,  and  which  can 
also  be  transmitted.  Synergetics  is  currently  also  developing  a 
meteorological  sensor  interface  module.  They  have  stated  that  other 
signal  conditioning  modules  will  be  developed  in  the  future. 

For  each  channel  selected  to  be  adaptive,  the  user  typically  inputs 
3  rates,  2  breakpoints,  and  1  slope  factor.  Each  time  the  sensors  are 
scanned,  the  reporting  rates  for  the  adaptive  channels  are  calculated 
as  the  maximum  of  the  rate  of  parameter  change  multiplied  by  the  slope 
factor,  and  the  base  reporting  rate  for  the  range  of  interest.  There  is 
a  single  platform  reporting  rate,  which  is  selected  as  the  maximum  of 
the  calculated  reporting  rates  for  all  of  the  adaptive  channels.  Once 
the  platform  reporting  rate  is  determined,  say  once  every  t  seconds,  a 
random  number  is  drawn  from  a  uniform  distribution  with  limits  0  to  2t, 
which  is  the  selected  distribution  on  random  reporting  interval.  The 
random  interval  is  added  to  the  time  of  last  transmission.  If  the 
current  clock  time  is  greater  than  or  equal  to  this  calculated  time  of 
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next  transmission,  then  the  platform  immediately  reports.  If  the 
calculated  time  of  transmission  is  after  the  current  clock  time,  but 
before  the  next  scheduled  sensor  scan,  then  the  unit  will  program  itself 
to  scan  the  sensors  and  report  at  the  calculated  transmission  time. 
Finally,  if  the  calculated  reporting  time  is  after  the  next  scheduled 
sensor  scan,  then  no  random  report  will  be  initiated,  but  instead  a  new 
calculated  reporting  time  will  be  computed  at  the  time  the  sensors  are 
next  scanned.  Additionally,  Synergetics  introduces  a  concept  of  momen¬ 
tum  --  which  is  simply  the  requirement  that  the  mean  of  the  distribution 
on  random  reporting  interval  not  decrease  faster  than  some  predetermined 
rate. 

Several  points  are  noteworthy  concerning  the  Synergetics  strategy. 
First,  the  reporting  times  are  explicitly  randomized  —  however,  there 
is  a  tendency  for  the  unit  to  report  at  times  coincident  with  the  scan 
interval,  namely  when  the  calculated  reporting  time  is  less  than  the 
current  time.  Second,  depending  on  the  way  the  user  programs  the 
platform,  a  variety  of  sensors  may  be  controlling  the  platform  reporting 
rate  --  in  a  relatively  difficult  to  predict  manner.  Thirdly,  as  a 
microprocessor  based  platform,  there  is  a  great  deal  of  flexibility  that 
can  be  programmed  into  the  hardware.  Simply  changing  PROM's  with  new 
instruction  sets  can  in  the  future  radically  alter  the  manner  in  which 
these  given  hardware  devices  will  perform.  This  is  true  for  all  brands. 

The  final  data  collection  manufacturer  identified  as  producing  a 
platform  capable  of  random  reporting  is  the  Sutron  Corporation  who 
manufactures  the  Model  8004B.  This,  like  the  Synergetics  and  possibly 
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Handar  platforms,  is  a  powerful  microprocessor  based  system.  The 
microprocessor  firmware  controls  the  platform's  reporting  in  self  timed 
and/or  adaptive  random  modes  on  two  separate  frequencies.  A  total  of  16 
sensors  may  be  monitored,  in  any  combination  of  analog  and  digital 
signals.  Each  of  these  monitored  parameters  may  be  assigned  to  one  of 
four  groups.  Each  group  is  scanned  independently  at  preprogrammed 
times;  and  depending  on  available  memory  and  groupings,  up  to  180 
samples  may  be  stored  for  the  parameters  in  the  group.  One  of  the 
parameters  assigned  to  each  group  is  used  to  control  the  adaptive 
reports.  Two  threshhold  levels,  and  three  base  reporting  rates  and 
slopes  are  defined  for  that  parameter.  Each  time  the  sensors  in  the 
group  are  scanned,  a  new  mean  adaptive  reporting  rate  is  calculated  as 
the  sum  of  the  rate  of  parameter  change  multiplied  by  the  slope  factor, 
and  the  base  reporting  rate  for  the  interval  of  interest  (as  an  option, 
Sutron  offers  to  compute  reporting  rate  as  the  maximum  of  the  rate  of 
change  multiplied  by  the  slope  factor  and  the  base  rate  for  the 
interval,  probably  involving  a  minor  firmware  modification).  Thus, 
there  is  a  separate  mean  adaptive  reporting  rate  for  each  of  up  to  four 
groups.  Once  the  group's  reporting  rate  is  determined,  say  once  every 
"t"  seconds,  a  random  number  is  drawn  from  a  uniform  distribution  with 
limits  0.5t  to  1.5t,  which  is  the  selected  distribution  on  random 
reporting  interval.  Note  that  there  are  time  periods  when  (it  can  be 
shown)  that  the  platform  will  not  report.  Note  also  that  with  each 
group  reporting  independently,  the  platform  would  behave  like  four 
separate  platforms.  Thus  a  user  configuring  a  platform  with  two  sensors 
in  a  single  group  is  imposing  potentially  significantly  less  burden  on 
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the  satellite  system  than  a  configuration  having  the  sensors  in  two 
separate  groups. 

Sutron  has  the  greatest  experience  in  manufacture  of  random 
reporting  data  collection  platforms  and  has  pioneered  much  of  the  work 
in  the  area.  In  addition,  they  have  made  research  contributions  with 
studies  for  the  New  England  Division,  Corps  of  Engineers,  and  with 
participation  in  writing  the  NESS  Users  Guide  to  Random  Reporting  (6). 

in  summary,  there  are  significant  differences  in  the  manner  in 
which  the  several  equipment  manufacturers  have  translated  the  theory  of 
random  reporting  to  practical  application.  The  LaBarge  and  Handar  524/B 
platforms  are  not  explicitly  random  --  "randomness"  is  introduced  only 
by  the  manner  in  which  they  scan  their  sensors,  and  the  attainment  of 
threshhold  levels  triggering  reporting  at  a  fixed  rate.  The  Synergetics 
and  Sutron  platforms,  and  possibly  the  Handar  Model  560/B,  explicitly 
randomize  their  reporting  times.  The  manner  in  which  they  specify  the 
distributions  on  random  reporting  interval,  and  incorporate  corollary 
concepts  such  as  momentum,  suggests  that  individual  platforms  quite 
likely  violate  the  NESS  Random  Reporting  Certification  Standard 
requirement  that  reporting  times  "shall  be  uniformly  random  within  the 
reporting  interval."  The  extent  to  which  this  leads  to  violation  of  the 
Poisson  arrivals  assumption  for  networks  of  platform  message  arrivals  at 
the  satellite  remains  to  be  examined.  Furthermore,  platform 
manufacturers  have  taken  a  variety  of  approaches  to  the  manner  in  which 
sensors  are  scanned,  grouped,  and  sensed  parameters  used  to  control  the 
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rate(s)  at  which  the  platform  reports.  Careful  assumptions  about  the 
manner  in  which  rates  are  determined,  and  perhaps  limitations  on 
allowable  regimes,  must  be  developed  for  any  meaningful  analysis  of 
network  performance.  Finally,  and  in  a  related  vein,  it  must  be 
recognized  that  the  state  of  the  art  platforms  are  relatively  powerful 
microprocessor  computers  --  and  can  be  configured,  programmed,  and 
operated  in  relatively  complex  ways  which  are  difficult  to  predict.  Two 
basic  strategies  are  possible  for  developing  good  models  for  predicton 
of  performance  of  networks  of  such  units.  First,  one  might  endeavor  to 
build  relatively  elaborate  "simulation"  models  to  capture  the 
complexity.  This  would  be  very  costly  and  impractical  in  that  it 
requires  the  analyst  to  forecast  the  complex  configurations  that  are 
possible.  Alternatively,  the  analyst  might  develop  simplified  analytic 
models.  Great  care  must  be  taken,  in  this  case,  to  insure  that  the 
actual  systems  implemented  are  constrained  so  as  not  to  grossly  violate 
the  assumptions  used  in  predictive  models  for  network  performance. 


2.J  Random  Reporting  System  Users 

A  number  of  government  user  groups  are  currently  using  or  are 
likely  in  the  near  future  to  begin  using  adaptive  random  reporting. 

These  include: 

Army  Corps  of  Engineers; 

Geological  Survey 

Bureau  of  Reclamation; 

National  Weather  Service; 

Soil  Conservation  Service; 

Bureau  of  Land  Management; 

Forest  Service; 

lennessee  Valley  Authority. 

These  users  are  distinguished  by  factors  including  the  numbers  of 
platforms,  types  of  platforms,  monitored  parameters,  grouping  of 
reported  parameters,  algorithms  for  reporting  rate  determination,  and 
reception  capabilities.  A  number  of  current  or  candidate  user  group 
programs  are  described  below. 

Perhaps  the  most  advanced  user  is  the  Bureau  of  Reclamation  in 
Boise,  Idaho.  This  group  started  installing  platforms  in  July,  1980. 

The  system  is  a  turnkey  installation  developed  by  Sutron.  Currently,  66 
platforms  are  operational,  as  is  a  ground  receive  station.  Roughly  12 
platforms  are  at  reservoirs,  and  measure  parameters  including  forebay 
elevation,  stream  elevation,  precipitation,  and  discharge.  Another  12 
platforms  are  purely  meteorologic  stations,  measuring  parameters  includ¬ 
ing  precipitation,  temperature,  soil  moisture,  and  water  content  of 


snowpack.  The  remaining  platforms  are  at  stream  gaging  stations,  and 
primarily  monitor  stream  levels  and  total  precipitation. 

The  platforms  in  use  are  the  Sutron  Model  8004B,  described  in  the 
previous  section.  All  are  configured  to  report  on  both  a  self-timed  and 
a  random  channel.  On  the  self  timed  channel,  each  reports  every  3 
hours.  Parameters  are  scanned  every  15  minutes.  The  random  reporting 
algorithm  varies,  depending  on  the  platform.  Up  to  three  group 
assignments  are  used  --  thus  the  effective  number  of  platforms  is  larger 
than  the  66  units  in  the  field.  Relatively  little  effort  has  been 
expended  attempting  to  forecast  channel  performance,  as  the  Bureau  has  a 
fully  dedicated  channel,  so  there  has  been  no  worry  about  adverse 
affects  on  other  users. 

This  solo  operation  is  currently  being  changed.  The  Bureau  is  in 
the  process  of  switching  to  Channel  128,  which  it  will  share  with  the 
Corps,  Missouri  River  Division,  and  the  Bureau  of  Reclamation,  Amarillo, 
Texas.  The  Boise  group  anticipates  addition  of  another  40  platforms  on 
the  Snake  River  within  a  year;  also  an  additional  20-25  platforms  in  the 
Deschute  River  Basin.  The  Bureau  in  Boise  has  done  little  ex  post  facto 
study  of  its  network  performance.  They  do  report,  however,  that  during 
one  flood  event  in  the  spring  of  1981,  40  of  the  66  platforms  became 
active  in  a  random  reporting  mode,  and  they  were  receiving  upwards  of 
2200  successful  random  reports  per  day. 

A  second  group  which  is  currently  using  random  reporting  is  the 
Corps,  Missouri  River  Division.  A  total  of  216  sites  have  been 
designated  for  satellite  data  collection.  The  Omaha  District  within  the 
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division  currently  has  8  operational,  and  28  nearly  operational 
platforms  (as  of  January,  1981).  The  Kansas  City  District  has  6 
platforms  operational.  Ultimately,  the  Omaha  District  will  have  110 
platforms  (30  additional  in  FY  82,  another  26  in  FY  83,  and  a  final  18 
in  FY  84).  Most  of  these  platforms  will  report  both  precipitation  and 
stage.  Currently,  using  the  Sutron  platforms,  stage  is  the  driving 
parameter  for -random  reports,  and  both  parameters  are  reported  in  a 
single  group.  Ultimately,  the  Kansas  City  District  will  have  106 
platforms  (50  additional  in  FY  83,  another  25  in  FY  84,  and  a  final  25 
in  FY  85).  The  Missouri  River  Division  is  participating  in  the  NESS 
random  reporting  experiment,  sharing  Channel  128  with  the  USBR  Boise. 
They  have  a  Memorandum  of  Understanding  with  the  Bureau  that  enables 
them  to  use  the  Bureau's  ground  station  until  1984. 

A  third  group  which  is  currently  using  random  reporting  is  the 
Corps,  Tulsa.  The  District  of  the  Southwest  Division  plan  ultimately  to 
install  132  platforms.  Currently,  they  have  38  platforms  authorized. 

Of  these,  10  will  be  Handar  model  524* s,  and  the  remainder  will  be 
Sutron  8004B  platforms.  Less  than  12  of  the  38  are  currently 
operational.  All  38  units  will  have  stream  and  precipitation  sensors. 
About  one  third  of  the  38  will  have  additional  parameters,  although 
there  are  no  detailed  specifications  thus  far.  This  office  of  the  Corps 
is  assigned  to  the  eastern  satellite,  Channel  129,  which  it  will  share 
with  the  New  England  Division.  Although  they  had  not  been  granted  final 
approval  (as  of  January  1982),  the  Tulsa  office  was  proposing  1.8  unit 
loads  per  platform  (a  definition  of  a.  unit  load  is  given  in  Chapters  3 
and  4)  during  the  NESS  test. 


A  fourth  group  which  will  begin  using  random  reporting  is  the  Corps 
of  Engineers,  New  England  Division. 

The  National  Weather  Service  currently  has  no  random  reporting  or 
interrogated  platforms,  although  they  have  ordered  a  few  for  testing  and 
evaluation.  The  Geological  Survey  is  reported  to  have  random  reporting 
platforms,  but  this  has  not  yet  been  verified.  For  NESS  test  purposes, 
the  Weather  Service  and  Geological  Survey  are  assigned  to  share  channels 
118  and  140.  According  to  NESS,  the  primary  interest  of  these  groups  is 
transmission  of  an  alert  warning  once  a  monitored  parameter  has  exceeded 
a  predetermined  threshhold  --  rather  than  the  continuous  real  time  data 
transmission  sought  by  the  Corps  and  the  Bureau  of  Reclamation. 

Groups  reported  to  have  random  reporting  capabilities,  but  which 
were  not  contacted,  include  the  Bureau  of  Land  Management,  Denver,  and 
the  Forest  Service,  Boise.  Groups  considering  developing  random 
reporting  capabilities  include  the  Corps,  Portland,  District  of  the 
Northwest  Division,  and  the  Tennessee  Valley  Authority.  They  are 
illustrative  of  the  scope  and  variety  of  hydro! ogic/meteorologic  random 
data  collection  users. 


CHAPTER  3 


Models  of  Data  Collection  Platform  Network  Communication 

3.1  Overview 

The  purpose  of  this  chapter  is  to  review  and  assess  relevant  model 
which  might  be  used  in  the  analysis  of  networks  of  random 
data  collection  platforms.  It  is  essential  to  keep  in  mind  the  obvious 
--  that  the  problem  is  simply  one  of  analyzing  communications.  These 
communications  are  between  distributed  users  and  a  central  site.  The 
fact  that  the  senders  are  microprocessor  controlled  data  collection 
platforms,  that  the  transmission  link  is  a  satellite,  and  that  received 
information  is  computer  routed,  should  not  disguise  the  simplicity  of 
the  underlying  problem. 

3.2  Network  Communication  Strategies 

Consider  a  network  like  that  shown  in  Figure  3.1.  In  the  most 
general  case,  each  node  can  function  to  generate,  receive,  store,  or 
route  transmissions.  Links  can  be  used  to  transmit  in  both  directions. 
Historically,  two  modes  of  network  communications  have  been 
distinguished: 

1)  circuit  switched  (pre-allocation) 

2)  packet  switched  (dynamic  allocation) 

Communicating  implies  transfer  of  information  between  selected 
origin-destination  pairs,  as  opposed  to  broadcasting,  which  would 
suggest  transmission  between  an  origin  and  all  possible  destinations  on 
the  network.  Each  of  the  two  modes  is  described  below. 


In  the  circuit  switched  pre-allocation  mode,  the  path  from  sender 
to  receiver  is  established  in  advance,  before  communicating  commences. 
Once  established,  this  path  or  circuit  is  maintained.  In  order  to 
efficiently  use  network  capacity,  this  scheme  requires  a  relatively 
strong,  centralized  assignment  control  mechanism.  The  advantage  to 
users  is  that  they  face  no  delays  in  communicating  their  information 
once  they  have  established  access  to  a  circuit.  No  information  storage 
is  required.  The  disadvantage  is  that  because  the  assignment  of 
circuits  cannot  readily  adapt  to  changing  demands,  the  network  capacity 
may  be  inefficiently  assigned  and  utilized.  Traditiona I ly ,  telephone, 
radio,  and  television  have  employed  circuit  switching. 

In  the  packet  switched  or  dynamic  allocation  mode,  “packets"  of 
information  may  be  sent,  stored,  sorted,  and  ultimately  routed  over  a 
path  in  the  network  to  their  destination.  The  path  is  not  pre-assigned, 
but  established  depending  on  conditions  in  the  network  at  the  time  of 
transmission  or  retransmission  of  data  packets  from  nodes  in  the 
network.  This  format  facilitates  relatively  decentralized 
communications  and  control.  One  disadvantage  is  that  delays  may  be 
encountered  as  information  is  stored  or  rerouted.  Further,  since  there 
may  be  no  centralized  control  of  access,  users  may  not  have  exclusive 
use  of  circuits  thereby  resulting  in  unreliable  communications. 

Examples  of  packet  switched  networks  have  included  telegraph  and  mail 
systems.  Computers  have  made  possible  high  speed  packet  switched 
communications,  by  greatly  enhancing  the  capacity  to  store,  sort,  and 


route  information  in  near  real-time.  Computer  controlled 
telecommunications  have  made  feasible  dynamic  allocation  systems  that 
in  some  aspects  are  superior  to  preallocation  systems  in  time, 
reliability,  economy,  and  flexibility  (9). 


3.3  Multiple  Access  Protocols 

When  a  multiplicity  of  users  can  have  access  to  a  shared  link  or 
path  in  a  communication  network,  rules  may  be  established  to  reduce  or 
eliminate  conflicts  between  users.  The  parameters  for  dividing  access 
include  time,  frequency,  and  encoding.  According  to  Lam  (14),  multiple 
access  protocols  have  traditionally  been  channel  oriented.  That  is,  the 
network  is  divided  into  separate  channels,  and  channels  assigned  on  a 
fixed  or  demand  basis. 

Three  distinct  protocol  classes  may  be  used  to  control  channel 
acccess: 

1)  reservation; 

2)  polling;  and 

3)  contention 
Each  class  is  described  below. 

The  reservation  protocol  seeks  to  eliminate  conflicts  between 
users.  If  reservations  are  static,  then  users  can  communicate  only  at 
specified  times,  on  specified  frequencies,  or  using  selected  codes. 
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If  reservations  are  to  reflect  user  requirements  in  a  dynamic  sense, 
then  two  issues  must  be  resolved.  First,  a  channel  must  be  established 
for  users  to  communicate  their  requirements  for  a  reservation.  Second, 
a  queue  of  user  reservations  must  be  maintained,  on  a  centralized  or 
distributed  basis.  Note  that  in  order  to  implement  adaptive  reservation 
systems,  users  must  be  able  to  both  send  and  receive  --  communications 
must  be  bidirectional. 

A  polled  or  interrogated  protocol  also  will  eliminate  conflicts 
between  users.  Users  are  given  access  to  a  channel  only  when 
interrogated  by  a  central  controller.  There  is  again  the  requirement 
that  users  have  the  ability  both  to  send  and  receive  signals. 

Generally,  depending  on  the  sophistication  of  the  central  controller, 
this  protocol  provides  a  fairly  efficient  mechanism  for  data 
acquisition.  However,  as  is  the  case  with  satellite  linked 
communication,  when  there  are  relatively  long  propagation  times  for 
communicating,  the  overhead  imposed  by  interrogation  of  users  can  become 
significant.  Note  that  in  a  pure  polling  system,  users  do  not  have  the 
opportunity  to  communicate  events  unless  polled. 

A  contention  protocol  does  not  seek  to  eliminate  all  conflicts 
between  users.  Each  user  independently  chooses  when  to  transmit, 
without  regard  to  other  users  who  may  be  transmitting  at  the  same  point 
in  time.  This  protocol  was  first  implemented  in  a  computer 
communications  network,  the  ALOHA  system,  at  the  University  of  Hawaii  in 


1y 7u  (14,  10,  11,  13,  lb).  Under  the  unslotted  ALOHA  protocol 
transmissions  are  unsynchronized  over  a  common  channel.  A  slotted  ALOHA 
protocol  simply  restricts  times  at  which  transmissions  can  be  initiated, 
but  otherwise  does  not  restrict  user  access.  Both  the  ALOHA  and  the 
slotted  ALOHA  protocols  were  developed  assuming  users  could  receive, 
thereby  getting  immediate  feedback  on  the  success  or  failure  of  their 
attempt  to  access  the  communication  channel.  The  protocol  is  feasible 
without  feedback,  but  in  this  case,  lost  transmissions  must  be  accepted 
or  data  repetition  methods  devised  to  insure  reception.  The  R-ALOHA 
protocol  is  a  variation  in  which  time  slots  are  organized  into  groups 
called  frames,  and  availability  of  a  particular  slot  depends  upon  the 
status  of  the  corresponding  slot  in  the  previous  time  frame.  The 
R-ALOHA  system  clearly  requires  bidirectional  communications.  A  similar 
hybrid  contention  model  known  as  the  URN  protocol,  which  is  adaptive, 
also  needs  feedback.  Note  that  the  contention  protocols  can  circumvent 
central  control  and  are  completely  user  initiated. 


3.4  Framework  for  GOES  DCS  Analysis 

The  GOES  system  as  a  packet  switched  system,  in  which  frequency 
bands  have  been  subdivided  into  channels,  is  well  suited  to  networks  of 
distributed  users  communicating  with  static  central  i-zed  control.  This 
sort  of  communications  is  made  practical  by  the  ability  of 
microprocessor  and  computer's  capabilities  to  quantify,  transmit  and 
store,  information  at  high  speed. 

When  multiple  users  have  access  to  a  single  network,  access  may  be 
divided  in  time,  frequency,  or  encoded  information.  The  GOES  system 
uses  discrete  channels.  Conflicts  in  time  on  a  single  channel  are 
avoided  by  one  of  the  three  types  of  multiple  access  protocols: 

1.  reservation  (self  timed) 

2.  polling  (interrogated) 

3.  contention  (random  access). 

NESS  has  elected  to  test  all  three  types  of  protocols.  Only  the  random 
access  protocol  allows  the  user  both  to  report  based  on  locally  observed 
events  and  to  avoid  the  significant  expense  of  receiving  capability. 

Although  a  number  of  articles  have  addressed  the  performance  of 
communication  networks,  the  basic  findings  of  Abramson  (11,  12,  15)  and 
others  (13,  14,  20)  are  especially  germane.  The  key  to  the 
applicability  of  these  results  in  analyzing  the  GOES  system  is  insuring 
that  the  system  is  compatible  with  the  assumptions  in  which  the  analysis 
are  predicated.  For  the  GOES  system,  the  regulations  and  certification 
standards  for  radio  sets  imposed  by  NESS  insure  that  most  of  the 
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asssumptions  can  be  satisfied  --  for  example,  transmissions  of  a  fixed 
length,  or  uniformly  random  message  starting  time.  Recall,  however, 
that  because  some  types  of  platforms  might  not  transmit  only  randomly, 
and  because  users  can  inadvertently  synchronize  transmissions,  NESS  must 
carefully  regulate  system  use  to  insure  that  key  assumptions  are  not 
violated.  The  rate  of  arrival  of  transmissions  at  the  satellite  is  an 
extremely  important  factor,  and  the  previous  assumptions  have  to  be 
satisfied. 

Although  the  literature  has  successfully  provided  models  of  network 
performance,  pure  contention  cases  in  which  undetected  transmission 
failures  can  occur  have  been  analyzed  only  in  the  NESS  User’s  Guide  (b). 
Moreover,  the  previous  analyses,  with  the  exception  of  some  limited 
simulation  studies  done  by  Sutron  Corporation  for  the  New  England 
Division,  Corps  of  tngineers  (4),  have  addressed  network  performance 
merely  assuming  given  platform  transmission  rates.  In  reality,  the 
platform  transmission  rates  are  a  function  of  sensed  climatic 
information,  platform  characteristics,  and  user  selected  input 
parameters.  This  transformation  of  sampled  data  to  distributed  reporting 
rates  is  non-trivial,  and  is  one  of  the  major  research  concerns  of  the 
current  project. 

Figure  3.2  presents  one  possible  representation  of  the  communica¬ 
tions  system  of  interest  to  the  Corps  of  Engineers  and  NESS.  One 
driving  force  for  system  performance  is  the  climate  to  be  sensed  and 
reported.  At  the  user  leve1,  a  network  of  platforms  is  installed  to 
provide  information  for  reconnaissance,  planning,  and  real  time  control. 
These  platforms  are  subject  to  constraints  imposed  by  the  authority 
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Figure  3.2:  Framework  for  data  collection  system  analysis 
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controlling  the  network  —  for  example,  message  formats  and  unit 
loadings.  At  the  control  level,  channel  access  and  assignments  are 
established.  A  number  of  measures  have  been  used  to  determine  the 
performance  of  such  a  system. 

These  measures  include  but  are  not  limited  to: 

*  reliability  —  the  rate  of  success  of  that  a  message  being 
received  without  interference; 

*  timeliness  —  the  delay  from  initial  transmission  attempt 
until  the  information  is  available  to  a  user; 

*  efficiency  —  the  level  of  utilization  of  channel  capacity, 
perhaps  as  measured  by  number  of  users  of  a  particular  type; 
and, 

*  equity  —  fair  allocation  of  resources  among  users. 

Given  climate,  platform  characteristics,  and  network  attributes,  one 
important  goal  is  to  forecast  network  performance  as  surnnarized  by 
selected  measures.  Given  the  inputs,  user,  and  control  decisions,  a 
descriptive  model  would  be  a  simulation  of  network  performance.  A  more 
difficult  problem  which  will  not  be  addressed  in  the  current  research  is 
that  of  optimization  of  network  performance.  That  is,  finding  the 
"best"  set  of  user  and  control  decisions  to  achieve  selected  network 
performance  goals.  The  difficulties  of  optimization  lie  in  that  the 
above  measures  of  performance  are  poor  surrogates  of  the  value  of 
information  during  the  ultimate  use  of  the  data. 
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descriptive  model  would  be  a  simulation  of  network  performance.  A  more 
difficult  problem  which  will  not  be  addressed  in  the  current  research  is 
that  of  optimization  of  network  performance.  That  is,  finding  the 
"best"  set  of  user  and  control  decisions  to  achieve  selected  network 
performance  goals.  The  difficulties  of  optimization  lie  in  that  the 
above  measures  of  performance  are  poor  surrogates  of  the  value  of 
information  during  the  ultimate  use  of  the  data. 
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3.b  Models  for  GOES  Network  Performance 

The  GOES  data  collection  system  introduced  in  Chapter  1  consists  of 
distributed  data  collection  platforms,  using  a  satellite  communications 
line,  to  transmit  sensed  data  to  a  central  ground  station  (or  stations), 
and  ultimately  to  end  users.  A  schematic  of  the  implied  communications 
network  is  shown  in  Figure  3.3. 

The  National  Earth  Satellite  Service  (NESS)  has  established  a 
number  of  conventions  which  regulate  the  manner  in  which  the  network  may 
be  accessed.  The  basic  strategy  has  been  that  of  packet-switching  or 
dynamic  allocation.  This  format  facilitates  relatively  decentralized 
communications  and  control.  Part  of  the  strategy  has  also  been  to 
divide  the  uplink  band  into  233  channels  —  so-called  frequency  division 
multiple  access.  Platforms  can  then  be  assigned  to  separate  reply 
channels,  thereby  to  a  certain  degree  eliminating  multiple  access 
conflicts,  at  least  in  the  frequency  domain. 

Currently,  NESS  is  using  pure  strategies,  i.e.,  all  platforms 
assigned  to  the  same  channel  are  subject  to  one  of  the  same  three 
protocols  described  in  the  previous  section.  The  above  NESS  protocols 
are  subsets  of  the  reservation,  polling,  and  contention  strategies 
described  earlier.  Note  that  the  links  between  user  and  the  ground 
station  have  not  been  closely  examined. 

The  reservation  or  self-timed  approach  restricts  data  collection 
platform  access  to  predetermined  time  slots.  Normally,  the  slots  are  of 
1  minute  duration,  and  platforms  are  restricted  to  8  slots  or  minutes 
per  day.  This  8-minute  time  allocation  per  platform  is  a  management 
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concept  for  allocating  time  on  channels  equitably,  and  is  known  as  the 
unit  load.  Typically,  a  self-timed  platform  does  not  have  to  receive 
signals  from  the  satellite,  provided  the  platform  contains  a 
sufficiently  accurate  clock,  so  that  the  timing  of  transmissions  is 
stable.  Note  that  the  static  reservation  system  selected  is  intended  to 
el iminateconfl icts  --  but  only  180  platforms  can  be  assigned  per 
channel,  and  there  is  no  adaptive  reporting. 

The  interrogated  system  allows  the  ground  station  to  poll  or  query 
users  when  a  report  is  required.  One  of  two  frequencies  is  employed  for 
polling  depending  upon  the  operational  satellite  being  used.  Naturally, 
each  data  collection  platform  must  have  the  ability  to  receive  satellite 
signals.  This  can  be  costly,  or  infeasible,  given  the  remoteness  of 
data  collection  sites  and  the  availability  of  reserved  or  random  access 
alternatives. 

The  random  access  protocol  available  to  GOES  users  allows  pure 
contention  among  users  for  channel  access.  Users  independently  decide 
to  transmit,  subject  to  the  constraint  that  the  total  time  of 
transmissions  per  random  reporting  platform  does  not  exceed  the  unit 
load.  Since  most  random  reporting  platforms  also  transmit  in  a  self 
timed  mode,  the  8  minute  access  time  is  further  restricted  to: 

a.  6  minutes  on  a  self-timed  channel, 

b.  2  minutes  on  the  random  access  channels  (average  of 
5  seconds  per  hour). 

The  message  formats  are  user  defined.  In  the  NED  application  the 
standard  message  is  approximately  3  seconds  long.  The  time  of  transmis¬ 
sions  is  taken  to  be  uniformly  random. 


Recall  that  because  each  platform  is  reporting  independently,  the 
possibility  exists  that  a  message  from  one  platform  may  collide  with 
those  of  other  platforms  sharing  the  same  channel.  Most  of  the  analyses 
of  ALOHA  type  systems  (see  previous  section)  assumed  bidirectional 
communications,  allowing  users  to  listen  and  hear  whether  their 
transmission  attempts  were  successful.  Should  an  unsuccessful  attempt 
be  detected,  the  user  could  always  retransmit.  Random  access  users  on 
the  GOES  system  are  not  equipped  to  receive  satellite  broadcast. 

Accordingly,  users  of  the  GOES  system  must  either  tolerate  lost 
transmissions,  or  adjust  their  transmission  rates  to  compensate  for  the 
lost  transmissions.  Adjustments  suggested  in  the  "Random  Reporting 
Users  Guide"  (6)  include  message  repetition  which  effectively  increases 
transmission  rates,  or  concentration  of  messages,  leading  to  larger 
message  units.  These  adjustments  are  critical  to  the  success  of  a 
unidirectional  random  reporting  system  which  requires  a  high  level  of 
reliability.  Analysis  of  this  type  of  pure  contention  system  has 
largely  been  in  relation  to  GOES  satellite  services. 

The  protocols  described  above  have  been  developed  to  achieve 
reliable  communications  in  a  multiple-access  mode  by  partitioning  time, 
and  are  referred  to  as  time  division  multiple  access  (TOMA).  An 
alternative  means  of  random  access  can  also  be  obtained  through 
frequency  division,  where  signals  are  partitioned  over  on  frequency 
spectrum.  This  is  termed  frequency  division  multiple  access  (FOMA). 

For  large  networks  in  which  users  (DCP's)  have  a  high  ratio  of  peak  to 
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average  data  telemetry  requirements,  such  as  that  of  most  event  based 
environmental  measurements,  the  TDMA  pure  contention  protocol  provides  a 
powerful  means  of  sharing  communications  resources  (10,  11,  14). 

Next  subsections  will  describe  analytical  performance  models  for 
the  two  most  popular  random  access  (contention)  protocols.  These  are 
the  slotted  and  unslotted  ALOHA  systems,  the  latter  corresponding  to  the 
mode  of  operation  selected  by  NED.  Emphasis  will  be  given  to  unslotted 
ALOHA's  and  methods  to  improve  their  performance. 

3.5.1  Analysis  of  Unslotted  ALOHA  Reporting  Schemes 

A  number  of  researchers  have  analyzed  performance  of  contention 
or  ALOHA  type  systems.  Abramson  (12)  provided  the  first  formal 
presentation  of  unslotted  ALOHA  channel  performance.  Work  by  Abramson 
(11,  12,  15),  Lam  (14,  20,  23),  Kleinrock  (26,  27,  28)  and  others  (13, 
16,  17,  18)  has  contributed  to  development  of  an  understanding  of  random 
reporting.  Recently,  the  Sutron  Corporation  (11)  and  the  Water  and 
Power  Resources  Services  (6)  have  developed  analyses  which  more 
comprehensively  address  the  GOES  network. 

Most  analyses  have  focussed  on  a  critical  period  (peak  loading)  of 
the  network.  The  following  are  assumed  to  be  known: 
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N  =  number  or  platforms  assigned  to  tne  channel. 

X^=  average  transmission  rate  from  platform  i  (transmissions/ 
second) 


N 

X  -  average  transmission  rate  from  N  platforms  =  £  x ^ 


T  =  duration  of  transmission. 

The  focus  has  been  on  the  network  of  platforms  on  a  channel  --  and 
not  the  manner  in  which  the  rates  are  determined  by  user  level  decisions 
or  climatic  inputs. 

A  transmitted  message  can  be  received  incorrectly  or  completely 
lost  due  to  two  different  types  of  errors:  (1)  random  noise  errors  and, 
(2)  errors  caused  by  message  overlap  (12).  Most  researchers  have 
concentrated  on  errors  of  the  second  type  and  the  same  approach  will  be 
taken  here.  Accordingly,  a  message  is  lost  if  transmissions  from  one  or 
more  platforms  collide  as  illustrated  in  Figure  3.4.  Define  D  as  the 
interval  between  transmissions.  If  a  single  DCP  sends  a  message  of 
duration  T  (T  much  smaller  than  D)  with  starting  time  uniformly 
distributed  in  D,  and  if  all  DCP's  act  independently  of  all  others,  then 
the  probability  of  collision  (failure)  is  the  ratio  of  collision 
interval  to  D. 


where  P  is  probability  of  failure  and  a  pair  of  DCP's  have  been 
considered  at  a  time. 
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Figure  3.4:  Collisions  in  unslotted  ALOHA  systems 
(adapted  from  10) 


The  complementary  probability,  that  of  the  pair  of  transmissions 


^  nut 


not  colliding  is: 


<-i-F 


(3.2) 


Equations  3.1  and  3.2  then  effectively  represent  success  and 
failure  probabilities  of  a  classical  Bernoulli  experiment  (one  where 
only  2  outcomes  are  possible).  If  there  are  N  stations ,“  the  probability 
of  no  failures  then  follows  the  well  known  Bernoulli  distribution.  We 
are  asking  for  no  collisions  in  N  (number  of  stations)  experiments.  Ihe 
result,  due  to  the  independence  of  stations,  is: 


Ps  =  qN  -  (1  -  fr) 


2T»N 


(3.3) 


where  is  the  probability  of  no  interference  among  the  N  stations  or 


effectively  tne  probability  that  any  station  will  successfully  transmit 
a  message.  The  complementary  probability  of  failure,  P^,  for  any 
station  in  a  field  of  N  is  then. 


Pf  -  1  -  (1  -  £[)N 


(3.4) 


The  term  of  the  form  (1  -  X)  in  the  aoove  equation  for  |X|  <1,  can  be 
expanded  in  a  series  as. 
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(1  -  X)N  =  1  -  XN  +  N(N-l)  -  -^,N(N-2)(N-2) 

X4 

+  |p  N(N-1 ) ( N-2 ) ( N-3 )  .... 

-XN 

If  N  is  large,  then  that  series  approximates  the  exponential  form  e 

2  3 

since  it  is  given  by  e~^  =  ^~X+-jjTN^-yr  +  ...  For  large  N, 
Equation  3.3  then  becomes: 

Ps  *  e  B  (3.5) 

and 

_|IN 

Pf  =  1  -  e  0  (3.6) 


Since  in  the  above  N  is  large  and  transmission  times  are  random, 
which  leads  to  random  times  between  transmissions,  then  we  can 
define 


X 


ii  a  l 

D  i£i 


Ai 


(3.7) 


which  can  be  interpreted  as  an  average  rate  of  transmission  of  the  network  or 
the  average  rate  at  which  the  satellite  receives  messages.  Using  3.7 
the  probability  of  successful  transmission  for  a  station  in  a  large 
network  is 


P 


s 


(3.8) 
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Probabilistically,  the  result  corresponds  to  the  asymptotic 
covergence  of  the  Binomial  distribution  (Equation  3.3)  into  the  Poisson 
distribution  (Equation  3.8).  As  the  number  of  stations  becomes  large 
the  arrivals  of  messages  at  the  satellite  becomes  a  Poisson  process  with 
rate  A.  The  probability  of  x  arrivals  at  the  satellite  in  time  t  is: 

P[x]  =  iiip -  (3.9) 

A  valid  question  is  how  large  N  has  to  be  to  make  the  Poisson 

2T 

approximation  valid.  This  depends  on  the  value  of  .  Since  T  is 

on  the  order  of  3  seconds  and  D  is  at  best  on  the  order  of  10  minutes, 

2T 

is  at  most  on  the  order  of  0.01.  The  following  table  illustrates  the 
approximation. 


N 

(1-.01 )N 

-.01N 

e 

1 

.99 

.99 

10 

.9044 

.9048 

20 

.8179 

.8187 

30 

.7397 

.74 

200 

.1340 

.135 

Clearly  the  results  are  extremely  good  over  a  wide  range  of  N  values. 
It  will  remain  so  as  long  as  2T/D  is  small. 


Given  the  above,  the  discussion  can  be  reoriented  to  the  point  of 
view  that  arrivals  of  messages  at  the  satellite  are  in  fact  Poisson 
distributed.  A  corollary  statement  is  that  the  time  between  arriving 
messages  is  exponentially  distributed 

f D ( D )  =  Ae“AD  (3.10) 

where  D  is  the  time  between  messages. 

In  order  to  re-develop  the  analysis  then  make  the  following 
assumptions  (6): 

1.  There  are  relatively  large  number  of  platforms  assigned 
to  the  channel,  N  >_  50  according  to  (6); 

2.  No  platform  uses  a  la.ge  amount  of  the  available 
transmission  time  [reference  (6)  suggests  TA^  <  0.1]; 

3.  The  average  rate  of  reception  at  the  satellite.  A- 
is  constant  for  the  time  period  of  interest: 

4.  Starting  time  of  transmission  arrivals  at  the  satellite 
is  statistically  independent  of  the  starting  time  of 
other  transmission  arrivals. 

5.  Transmissions  are  of  a  fixed  and  equal  length,  T. 

6.  Errors  due  to  random  noise  are  negligible,  but  if  a 
transmission  overlap  occurs,  all  information  is  lost. 


54 


i  1  v  ■  r 


T 


f« 


R 


Let: 


b  =  average  number  of  transmissions  attempted  in  T  seconds, 
the  channel  traffic  or  loading 
S  =  the  average  number  of  successful  transmissions  in  T 
seconds,  the  throughput 

?  -  the  probability  of  success  of  one  transmission. 


Then  by  definition 


G  =  AT 


(3.11) 


Also, 


Ps  =  S/G 


(3.12) 


That  is,  the  probability  of  success  is  simply  the  average  rate  of 
success  of  transmissions. 

Because  transmission  deviation  for  all  users  or  platforms  is 
assumed  constant,  a  single  transmission  starting  at  time  t  will  be 
successful  if  no  other  transmissions  occur  in  the  interval  t-T  to  t+T 


i 

.■J 


« 

(see  Figure  3.4).  That  is 
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where  P[A|Bj  stands  for  probability  of  A  given  that  B  occurs.  Since 
starting  times  of  messages  are  assumed  independent,  the  probability  of 
no  transmission  at  time  t  is  also  independent.  Therefore 

p  *  probability  no  transmissions 

s  f  1 

L  in  interval  1 


The  likelihood  of  this  event  depends  on  the  rates  and  distribution  of 
message  generation. 

But  from  Equation  3.9  the  probability  of  no  arrivals  in  interval  t 
is, 

P[X=0]  =  e'U  (3.13) 


Accordingly,  for  an  interval  of  duration  2T, 


(3.14) 


That  is,  the  probability  of  success  of  one  transmission  depends  only  on 
the  network  transmission  rate  and  message  duration.  Substituting  from 
the  definition  of  Equations  (3.11)  and  (3.12),  it  is  easily  shown  that 


(3.15) 
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a  no 


S  =  Ge"^  (3.16) 

That  is,  the  probability  of  success  and  the  channel  throughput  can 
easily  be  related  to  the  channel  loading. 

Figure  3.5  displays  equations  (3.15)  and  (3.16)  as  a  function  of 
channel  loading.  Observe  that  channel  throughput  is  maximized  at  0.184 
when  the  overall  channel  loading,  G,  equals  0.5.  At  this  level  of 
loading,  channel  utilization  efficiency  is  maximized;  however,  the 
network  is  relatively  unrel iable,  with  the  probability  of  success  of  a 
single  transmission  only  0.368.  Observe  that  Pg  decreases  monotonical ly 
as  channel  loading  increases. 

Figure  3.6  displays  the  tradeoffs  between  reliability  and 
throughput,  a  measure  of  efficiency.  Assuming  both  higher 
levels  of  reliability  and  throughput  are  preferred,  all  channel  loadings 
higher  than  0.5  are  dominated  --  there  exist  better  alternatives  at 
lower  loading  levels.  The  decision  maker  must  then  tradeoff 
high  reliability  for  efficiency,  and  vice  versa.  The  overall  channel 
loading  selected  must  represent  a  compromise  between  these  two 
conflicting  objectives. 

If  the  user  chooses  to  operate  at  a  channel  loading,  G,  of  0.5, 
then  it  is  possible  to  make  a  statement  on  the  maximum  allowable  number 
of  stations.  Remember  G  =  AT,  where 
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Figure  3.-5-J.  PROtJABILITY  OF  SINGLE  TRANSMISSION  ARRIVING  WITHOUT  INTERFERENCE 
AND  OVERALL  SUCCESSFUL  TRANSMISSIONS  IN  TIME  T  (S)  AS  A  FUNCTION 
OF  OVERALL  CHANNEL  LOADING  (6) 


N 

A  =  )  A,  =  NX 

i=l  1 

and  X  is  interpreted  as  an  average  transmission  rate  (transmissions  per 
unit  time)  for  all  N  stations.  Then  choosing  G  =  0.5  leads  to 

0.5  =  NIT 
or 


N 


max 


(2AT)"1 


(3.17) 


In  the  above  it  is  assumed  that  a)  A  is  a  known,  fixed  quantity;  b)  T  is 
the  same  for  all  stations;  and  c)  a  reliability  (probability  of  success 
ful  transmission;  of  only  0.368  is  acceptable. 

Figure  3.7  illustrates  Equation  3.17  for  different  transmission 
lengths  and  average  station  reporting  rates. 


3.5.2  An  Improvement  in  Performance:  Slotted  ALOHA  Systems 
The  unslotted  ALOHA  system  suffers  from  low  probability  of 
successful  transmissions  and  low  channel  utilization  in  terms  of 
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Average  time  between  DCP 
transmissions  (  hours  ) 


throughput.  Several  modifications  to  the  system  have  been  suggested.  A 
popular,  well  studied,  alternative  is  the  slotted  ALOHA  system  (11, 

10,  12,  14b).  The  principle  is  to  introduce  some  order  into  the  otherwise 
completely  random  ALOHA  system.  The  order  comes  by  dividing  time  into 
slots  of  duration  equal  to  the  transmission  length.  DCP's  are  then 
allowed  to  transmit  synchronously  with  the  beginning  of  a  slot. 
Otherwise,  DCP's  still  perform  independently  from  one  another.  The  result 
of  this  time  slotting  is  that  failure  or  loss  of  messages  occurs  only  by 
complete  and  exact  overlap  as  illustrated  in  Figure  3.8. 

Following  Abramson  (11),  the  analysis  of  this  system  follows. 

Define  as  the  probability  that  the  i^  DCP  will  transmit  in  a 
particular  slot.  Given  N  independent  DCP's,  the  total  normalized 
traffic  in  the  channel  is 

G  =  J  Gi  (3.18) 


1. 

Define  as  the  probability  that  the  i  DCP  is  the  only  one 
transmitting  in  a  given  slot,  .  The  total  normalized  channel 

throughput  is  then 


The  probability  of  the  ith  DCP  sending  a  message  and  successfully 
completing  it  among  the  N  independent  DCPs  is 


N 

" (1  -  V 


j=l 


(3.20) 


I 


which  is  just  the  product  of  the  probability  that  the  other  N-l  stations 
do  not  transmit  in  that  slot. 

Using  the  definition  of  ,  it  must  then  hold  that 


N 

Si  =  Gi  n  (1  -  Gj)  (3.21) 

j  =  l 
j*1 


But  if  all  users  are  identical,  from  (3.18)  and  (3.19) 


Gi  *  i  and  si  *  I 


so  3.21  yields 


S  =  G(1  -  f)N_1  (3.22) 


which  as  N  +  »  results  in 
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$  =  Ge"G  (3.23) 

Notice  that  if  the  Poisson  message  arrival  assumption  is  accepted. 
Equation  (3.23)  could  have  been  arrived  at  very  quickly.  Total 
throughput,  S,  must  be  total  traffic  times  probability  of  successful 
transmissions. 


But  according  to  the  method  of  failure  described  in  Figure  3.7,  a 
successful  transmission  results  from  no  transmissions  in  interval  T, 
which  from  Equation  3.9  results  in 


Ps  =  P[0]  =  e'AT 


(3.24) 
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where  x  is  the  total  average  message  arrival  rate.  Using  the 
definition  of  total  traffic  as, 

G  =  XT 

then  (3.24)  results  in 


or  S  =  Ge  as  previously  obtained. 
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The  performance  of  a  slotted  AHOHA  (Equation  3 .23)  is  compared  to 
an  unslotted  (pure)  ALOHA  (Equation  3.16)  in  Figure  3.y.  A  maximum 
throughput  ot  0.368  is  achieved  at  a  traffic  of  C  =  1.  The  probability 
of  successful  transmission  remains  0.368  at  this  maximum  throughput 
condition. 

The  difficulty  of  the  slotted  ALOHA  system  in  terms  of  C  of  E 
applications  is  the  requirement  of  accurate  timing  devices  sufficiently  accurate 
to  maintain  sychronization  of  messages  and  slots.  This  added  expense  is 
significant;  the  system  is  also  more  apt  to  destabilize  and  will 
require  higher  maintenance. 


3.5.3.  Improvements  in  Unslotted  ALOHA  Performance 

NESS  and  C  of  E  have  essentially  selected  the  unslotted  ALOHA  system  as 
their  method  of  random  reporting.  In  fact,  for  bursty,  short  messages 

typical  of  environmental  applications,  the  unslotted  ALOHA, 
random  reporting,  system  approaches  or  surpasses  the  efficiency,  in 

terms  of  channel  utilization,  of  operational  protocols  requiring  more 
expensive  and  sophisticated  equipment.  This  is  acknowledged  by  all 
investigators  and  illustrated  in  Figure  3.10,  which  plots  channel 
efficiency  versus  message  length  for  various  protocols,  including  random 
reporting  with  various  success  probabilities. 

Many  valuable  results  have  been  reported  for  random  (unslotted 
ALOHA)  reporting.  Abramson  (II)  shows  how  total  channel  throughput  can 
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MESSAGE  LENGTH  IN  DITS  (DATA  ONLY) 

Figure  3.10:  Comparison  of  reporting  protocols  for  the  GOES  data  collection 
system  (6) 


significantly  be  reduced  (below  the  0.184  maximum)  if  messages  of  the 
various  DCF's  are  of  different  lengths.  In  this  report  message  lengths 
will  be  assumed  the  same  for  all  users.  Metzner  (16)  shows  tnat 
throughput  can  be  increased  (by  about  bU  percent)  by  grouping 
transmitters  into  high  and  low  power  sets.  The  concept  is  that  although 
collisions  may  occur,  high  power  transmissions  may  override  low  power 
ones  and  still  be  received  correctly.  We  will  continue  using  the  con¬ 
servative  view  that  all  DCP's  have  the  same  power,  and  collisions  lead 
to  complete  loss  of  data. 

Sant  (13;  studies  random  reporting  both  by  relaxing  the  assumption  of 
exponential  inter-arrival  times  and  by  letting  each  DCP  have  an  arbitrary 
distribution  or  message  transmission.  He  shows  that  when  the  traffic 
load,  ,  of  each  station  l  is  small,  relative  to  tne  total  traffic  G 
(the  common  case),  tne  throughput  becomes  tne  same  as  that  of  the 
unslotted  ALOHA  system  (Equation  3.16).  Furthermore  the  throughput  will 
always  be  bounded  by  the  unslotted  and  slotted  ALOHA  results  (between 
Equations  3.16  and  j.2 3).  In  summary  .the  Poisson  arrival  moael  is  very 
roDust  in  a  statistical  sense. 

For  the  Corps  and  most  environmental  data  users  the  main  concern  is 
the  low  reliability  of  message  reception  achieved  with  the  random 
reporting  scheme.  The  NESS  Users  Guide  for  Random  Reporting  (6) 
proposes  three  methods  to  increase  probability  of  successful 
transmissions  without  changing  protocols. 

1.  one  short  transmission  per  message:  so  as  to  achieve  high 
probability  of  success  for  a  single  transmission  (i.e., 

G  =  0.025  leads  to  P$  =  0.95). 
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c.  K  short  transmissions  per  message:  random  spacing  of  the 
identical  messages  to  raise  probability  that  at  least 
one  of  the  K  trials  was  successful.  This  effectively 
leads  to  increasing  transmission  rates  by  K  while  keeping 
message  duration  constant. 

3.  K  messages  in  one  long  transmission:  append  last  K-l 
messages  to  current  transmission,  leading  to  K 
opportunities  (trials)  to  successfully  receive 
the  data.  This  effectively  increases  message  duration  by 
K  and  keeps  transmission  rates  constant. 

The  first  method  can  be  analyzed  directly  using  Equations  (3.15)  and 
(3.16).  Methods  2  and  3  require  the  following  fairly  simple  extension. 

Interpret  transmission  as  an  independent  trial.  Define 

P  =  probability  [one  or  more  successes  in  K  trials] 

=  1  -  probability  Lno  successes  in  K  trials] 

Assuming  Bernoulli  trials 

P  =  1  -  (1  -  P$)K 
Substituting  from  Equation  (3.15): 

P  =  1  -  (1  -  e'26)K 


(3.25) 


(3.26) 
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This  equation  can  be  used  to  evaluate  the  effect  of  message  rep.  .ion 
on  channel  performance.  For  method  2  the  effective  total  traffic,  is 
6  =  N(T0  +  Tl)KA  where  A  is  the  average  transmission  rate  of  a  station 
in  the  network;  TQ  is  some  overhead  time  associated  with  each 
transmission;  and  is  the  duration  of  data  transmission.  For  method  3, 
G  =  N  7  (Tq  +  KT-j ) . 

Figure  3.11  shows,  assuming  a  target  reliability  for  one  or  more 
messages  of  P  =  0.95,  A  =  1/hour,  and  T  =  2,  4,  and  8  seconds,  the 
maximum  numbers  of  identical  platforms  that  can  share  a  channel  under 
various  repetition  regimes.  Note  that  Method  3  always  provides  a 
greater  number  of  platforms  for  a  given  level  of  reliability  than  Method 
2  or  Method  1,  the  poorest  of  the  three  alternatives.  Figure  3.12 
provides  similar  results  when  the  target  probability  of  success  is  0.99 

For  the  cases  shown,  up  until  roughly  K  =  3  to  5  repetitions,  the 
number  of  platforms  able  to  share  a  single  channel  increases 
dramatically.  Beyond  this  point,  the  increase  in  number  of  platforms 
able  to  share  a  channel  is  very  small.  For  large  K,  this  number  will 
actually  start  to  decrease  as  the  channel  becomes  heavily  loaded.  This 
gives  guidance  as  to  the  optimal  number  of  message  repetitions. 

If  efficiency  (numbers  of  platforms)  were  the  only  objective,  then 
Method  3  should  always  be  employed  for  the  given  example,  which 
approximates  GOES  hydrometeorological  system  user  characteristics. 
Nevertheless,  recall  that  another  criterion  is  the  timeliness  of  message 
receipt.  Under  Method  1,  or  cases  where  K  =  1,  there  is  no  delay 
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OVERALL  CHANNEL  LOADING.  G 

Figure  3. 120VERALL CHANNEL  LOADING  AS  A  FUNCTION  OF  TH 
NUMBER  OF  CHANNELS  FOR  X  »  1  HR.  P  =  0.99  ( 6 ) 


between  message  receipt  and  transmission.  Under  Method  2,  a  constant 
one-half  hour  delay  is  observed  (for  the  T  =  2  seconds  case).  Under 
Method  3,  the  average  delay  increases  as  a  function  of  the  number  of  repi- 
titions  as  (K  -  1)  x  1/2  nour.  Clearly,  if  timeliness  is  a  concern, 
the  user  faces  a  tradeoff  between  timeliness  and  efficiency. 

The  Users  Guide  tor  Random  Reporting  (b)  recommends  Method  2  based 
as  ottering  the  best  compromise  between  timeliness  and  efficiency,  and  we 
agree.  Furthermore,  that  document  correctly  recognizes  from  the  figures 
tnat  with  a  channel  loading  of  G  =  0.25,  three  and  five  repetitions 
virtually  guarantee  yb  and  99  percent  success  probabilities 
respectively.  Higher  loadings  and  repetitions  lead  to  diminishing  and 
insignificant  improvements  in  the  number  of  platforms  possible.  Using 
that  criterion  and  the  definition  of  G  leads  to 

G  =  0.25  =  NKlT 

or  the  maximum  number  of  stations  is: 

Nmax  =  (3.27) 


With  K  equal  to  3  and  5  repetitions,  with  N__v  or  fewer  stations,  success 

inciX 

probaDi nties  should  be  at  least  0.95  and  0.99, respectively. 
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Equation  3.27  is  a  good  criterion  and  will  play  a  major  role  in  the 
analysis  of  the  next  Chapter.  As  it  stands,  it  requires  that  \ ,  the  average 
station  reporting  rate,  is  a  known  fixed  quantity.  This  is  not  the  case  when 
DCPs  adapt  to  temporally  and  spatially  varying  hydrologic  events. 


Chapter  4 


Network  Design  Criteria  and  Analysis  Procedures 

4.1  Introduction 

Chapter  3  discussed  the  tools  and  techniques  that  have  been 
proposed  for  the  study  of  random  reporting  networks.  Chapter  2 
described  the  nature  of  Data  Collection  Platforms  and  their  use  in 
monitoring  environmental  data.  In  this  chapter  we  intend  to  establish 
guidelines  for  the  design  of  a  national  network  of  DCP's. 

It  was  obvious  in  Chapter  2  that  the  number  of  active  data 
collection  platforms  is  expected  to  increase  dramatically  over  coming 
years,  with  numbers  on  the  thousands  foreseen  in  the  near  future.  For 
the  Corps  of  Engineers,  NESS,  and  other  users,  this  presents  a  major 
planning  and  logistics  problem.  Some  of  the  issues  that  will  arise  are 

1.  Allocation  of  satellite  channels  (frequencies)  to  users. 

2.  Efficient  distribution  of  data  in  the  satellite-user 
leg  of  the  communication  system. 

3.  Reliability  of  hardware  and  communication  links. 

4.  Fair  distribution  of  resources  and  satellite  access. 

5.  Control  and  policing  of  users'  actions  that  may  lead  to 
system  failure  or  overload. 

Insure  equitable  and  reliable  message  reception  for 
various  users  while  maximizing  utilization  of  resources. 


6. 


This  work  really  addresses  point  number  6  above.  It  is  assumed 

that: 

1.  NESS  will  provide  a  limited  number  of  frequencies  to  be 
used  in  random  reporting,  limited  with  respect  to  the 
number  of  users. 

2.  The  communication  downlink  between  satellite  and  user  is 
not  a  problem. 

3.  Hardware  reliability  is  good. 

4.  Control  and  policing  activities  will  develop  and  conform 
to  the  assumptions  made  in  the  nation-wide  network  design 
algorithm.  This  last  point  is  particularly  important 
since,  as  was  discussed  in  Chapter  2,  the  options  on  DCP 
configurations  and  reporting  algorithms  are  practically 
infinite.  It  is  expected  that  C  of  E  will  have  to  maintain 
some  control  on  this  configuration  so  as  to  satisfy 
design  assumptions  as  closely  as  possible. 

Point  6  remains  a  problem  because  it  cannot  be  addressed  solely 
with  the  procedures  discussed  in  Chapter  3.  There  it  is  assumed  that 
all  DCPs  are  transmitting  at  a  known  rate.  In  fact,  the  reporting 
will  adapt  according  to  the  magnitude  of  environmental  excitation, 
it  would  be  possible  to  make  the  assumption  that  all  stations  respond  at 
the  maximum  possible  rate.  Given  the  fact  that  rainfall  and  runoff  (the 
<L  environmental  inputs  of  most  interest  to  C  of  E)  vary  widely  in  space  and 
time  over  the  continental  U.S.A.,  such  an  assumption  would  be  extremely 


conservative  and  an  expensive  proposition.  Notice  from  Figure  3.11  that 
assuming  a  modest  transmission  rate  of  1  per  hour,  3  repetitions,  2 
seconds  messages  and  the  recommended  loading  (G)  of  0.25  to  achieve  95% 
reliability,  the  maximum  number  of  stations  per  channel  would  be  about 
150,  a  number  not  much  above  the  needs  of  a  single  C  of  E  district. 

This  chapter  will  propose  a  methodology  to  explicitly  include  the 
variability  of  environmental  parameters  in  time  and  space  in  the 
decision  of  how  to  allocate  channels  to  users.  The  methodology  is  of 
descriptive  nature.  Given  a  set  of  parameters,  e.g.,  possible  reporting 
rates,  number  of  stations  per  user,  climatic  characteristics,  message 
duration,  etc.,  the  procedure  will  evaluate  the  level  of  performance  of 
the  system.  An  alternative  would  have  been  a  prescriptive  model  that 
would  configure  the  network  so  as  to  optimize  a  given  set  of  objectives. 
This  approach  was  not  taken  due  to  the  difficulties  in  quantifying  the 
possibly  many  and  conflicting  objectives  of  users  and  managers. 

Many  of  the  ultimate  design  decisions  should  remain  functions  of 
unquantifiable  policy  goals. 

The  methodology  is  not  a  simulation  model.  This  approach  would 
have  been  unjustified  given  present  uncertainty  on  number  and  possible 
locations  of  stations  and  given  the  nature  of  data  available. 
Computationally  it  would  have  become  an  unwieldy  exercise  that  could 
obscure  results  rather  than  illuminate  them. 


4.2  Streamflow  versus  Rainfall 


The  Army  Corps  of  tngineers  and  other  users  are  in  fact  mostly 
interested  in  river  discharge.  Data  Collection  Platforms  can  be  driven 
by  streamflow  or  rainfall.  Streamflow  is,  nevertheless,  a  very  local 
behavior.  The  river  basin  and  sub-basins  are  filters  that  transform 
high  frequency  rainfall  into  low  frequency  streamflow.  Although  it  may 
be  raining  over  a  large  area,  the  affected  river  basins  transform  this 
input  differently.  They  introduce  time  delays  and  storage  effects  so 
that  even  a  highly  correlated  spatial  input  will  result  in  uncorrelated 
hydrographs,  time  distribution  of  discharge  at  various  points  will  not 
coincide  at  all  within  or  among  oasins.  So  the  use  of  rainfall  as  the  focus  of 
the  methodology  tends  to  overestimate  the  response  rate  and  correlation  among 
platforms.  Rainfall  will  probably  require  high  sampling  rates 
concurrently  over  large  areas  to  an  extent  not  probable  with  streamflow. 

Due  to  the  local  nature  of  streamflow,  it  makes  no  sense  to 
generalize  behavior.  It  only  makes  sense  to  talk  about  response  where 
the  stations  are  located.  At  this  point  it  is  not  known  where  stations 
are  located  and  furthermore  it  would  be  an  unmanageable  task  to  look  at 
all  local  behavior  over  thousands  of  locations  even  if  the  sites  and 
data  were  available. 

Streamflow  behavior  is  also  so  dependent  on  antecedent  conditions 
that  generalizations  of  behavior  from  rainfall  analysis  seem  unwise. 

In  summary  for  this  project  rainfall  was  chosen  as  the  DCPs'  triggering 
phenomena.  It  is  a  conservative  assumption  in  terms  of  DCP  performance;  data  is 
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easily  generalized  due  to  its  homogeneous  characteristics  over  areas  and 
its  statistical  stationarity  over  the  time  scales  of  interest;  and  as 
will  be  seen,  reasonably  good  data  sources  are  available. 

Acknowledgeably ,  this  decision  will  introduce  several  problems  of 
operational  nature.  One  is  how  can  C  of  E  district  officers  "translate" 
reporting  rates  associated  with  discharges  to  rates  associated  with 
rainfall.  Second  is  the  decision  over  what  time  step  rainfall  is  going 
to  be  studied.  These  issues  will  be  discussed  in  Chapter  6  and  5 
respectively. 

4.3  Network  Design  Algorithm 

Several  assumptions  are  made  in  the  suggested  approach: 

1.  A  "user"  is  defined  as  a  Corps  of  Engineers  District  or  any 
other  non-overlapping  geographical  unit. 

Z.  All  users  emoloy  the  same  message  length  and  will  strive  for 
message  reliability  using  repetition  method  Z  as 
described  in  the  “Users  Guide  for  Random  Reporting"  (6)  and 
Chapter  3.  The  number  of  repetitions  is  taken  to  be  at  least 
3,  tne  same  for  all  users. 

3.  All  stations  witmn  a  user  have  the  same  adaptive  reporting 
rate  algorithm.  This  implies  the  same  "threshold  and  slope" 
parameters.  Different  users  can  have  different  parameters. 

4.  Trading  of  unit  loads  between  users  will  not  be  allowed. 


5. 


Each  user  will  attempt  to  use  up  to  their  available  unit 
loads.  A  unit  load  is  presently  denned  by  NESS  as  120  sec. 
per  day  per  station  or  5  sec.  per  hour  per  station.  This  is 
considered  a  parameter  in  the  technique  in  that  it  can  be  varied. 

6.  Each  user  will  design  its  network  for  a  "worst"  local 

condition.  The  meaning  ot  “worst"  will  be  expanded  later;  it 
refers  to  possible  combinations  of  reporting  rates. 

The  approach  taken  identities  two  constraints  in  DCH  deployment  and 

use. 

1.  NESS  limits  each  user  by  the  unit  load  concept.  Therefore, 
each  user  cannot  exceed  its  quota. 

2.  Message  reliability  has  to  be  reasonably  high  for  all  users. 

The  reliability  of  each  user  is  very  much  dependent  on  other 
users,  even  though  each  user  operating  alone  would  not  run 
into  a  reliability  problem. 

With  the  above  in  mind  the  network  design  is  taken  in  two  steps: 
one  local  and  one  national.  At  the  locel  level  each  user  configures  its 
network  by  selecting  the  criteria  of  maximizing  its  information,  i.e.,  using 
as  much  of  the  unit  loads  available  to  him.  At  this  level  reliability 
will  rarely  be  a  problem,  but  it  will  be  checked.  At  the  national 
level,  all  users,  after  their  local  design,  must  satisfy  a  given  level 
of  message  reliability.  If  the  national  network  fails  to  satisfy  this 
reliability  level,  the  design  reverts  to  the  local  level  where 
adjustments  in  number  ot  stations  and/or  reporting  rates  must  be  made. 

How  to  allocate  these  adjustments  will  be  further  discussed. 


4.3.1.  Procedural  Details 


The  underlying  difficulty  in  the  above  outlined  design  philosophy 
is  that  the  DCP's  are  activated  at  various  reporting  rates  by  a  random 
climate.  Therefore,  the  local  and  national  constraints  can  be 
nationally  satisfied  only  at  a  given  level  of  probability. 

The  unit  load  limitation  was  discussed  in  Chapter  3  and  is 
presently  established  by  NESS.  Reliability  will  be  tested  using  the 
results  given  in  the  "Users  Guide  for  Random  Reporting"  and  Chapter  3. 
Based  on  good  analysis  and  rationale,  it  was  there  concluded  that  95% 
reliability  could  be  achieved  by  using  repetition  Method  2  with  3 
repetitions  and  a  channel  loading  of  0.25.  Channel  loading  is  defined 
by 


G  =  N  K  A  T  (4.1) 

where  A  is  the  avpraoe  station  reporting  rate  of  the  system  in  messages 
per  second,  T  is  the  message  duration  in  seconds,  K  is  the  number  of 
repetitions  and  N  is  the  number  of  stations.  A  reliability  of  99%  could 
be  achieved  by  repeating  5  times  and  having  the  same  channel  loading. 
With  the  above,  it  followed  that  for  a  known  average  station  reporting 
rate,  A  (transmissions  per  second),  the  maximum  number  of  stations 
possible  (and  still  maintain  reliability)  is, 

"™*  *  (4  K  1  T>''  <4-2> 
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Notice,  though,  that  due  to  the  random  climate,  I  i<;  a  random 
variable.  Therefore,  unless  A  is  taken  at  tin.  extremely  conservolive 
maximum  possible  value,  there  is  a  finite  probability  of  having 
(4  K  A  T)  ^  >  Nnax  which  will  lead  tc  I cv.t  r  reliability.  Tf:t 

probability  of  this  occurrence  is  a  design  criterion. 


4. 3. 1.1  Local  Design 

A  user  will  proceed  as  follows: 

1.  A  number  of  desired  stations,  N  ,  is  fixed,  based  on  need, 

J 

tradition,  etc.  Subscript  j  indicates  the  user. 

2.  User  j  unit  loads  are  given  by 

K’A^  •  3600fser/hr)*l(sec)/5(se-c/hr7t.l .  ) 

is  the  number  of  messages  per  second  at  station 
j.  There  are  b  seconds  per  hour  per  unit  ioao. 
is  reformulated  in  light  of  the  randomness  of 


3600  '  T/5  <  1  with  probability  P  (4.3) 

where  A. is  a  random  variable  representing  the  average 

J 

transmission  rate  of  user-  j  stations  and  is  g i ver.  by: 


nj 

.i, 


where  A ■  • 

1  J 

i  of  user 
The  above 


Aij  as: 


K”  A 
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(4.4) 


ii 

j 

J  Nj  i=l 

A,  is  a  ranaom  variable  because  each  station  response  rate, 
A..,  is  unknown  and  dependent  on  the  precipitation  input. 

'  \J 

For  reasons  that  will  be  apparent  in  Chapter  5,  when  the 
available  climate  information  is  discussed,  the  adaptive 
reporting  algorithm  will  be  modified  so  that  A- ■  can  take  one 

*  J 

of  n  values,  depending  on  the  rainfal 1  accumulation  of  a 
rainfall  event.  For  the  sake  of  clarity,  take  n=8.  So, 


Ai j  =  tblj’b2j’b3j’b4j,b5j’b6j’b7j’b8j} 


(4.5) 


The  eight  rates  could  depend  (for  example)  on  the  following  8 
rainfall  conditions  in  any  one  station,  respectively:  1)  no 
rainfall  (or  <  0.01  inches),  2)  rainfall  between  0.01  and  C.5 
inches,  3)  rainfall  between  0.5  and  1.0  inches,  4)  rainfall 
between  1.0  and  l,25inches,  5)  rainfall  between  1.25and  1.50 
inches,  6)  rainfall  between  1.5  and  2.0  inches,  7)  rainfall 
between  2.0  and  3.0  inches,  and  8)  rainfall  greater  than  3.0 
inches. 
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Equation  4.4  is  a  sum  of  random  variables  with 

distribution  that  will  tend  to  normality  reasonably  fast  as  N. 

J 

increases,  given  that  the  distribution  of  total  rainfall  in 
each  location  is  generally  accepted  to  be  nearly  gamma 
distributed  (31).  Assuming  normality  the  parameters 
required  to  fully  define  the  distribution  of  X.  are  then  its 

J 

mean  and  variance. 


The  mean  of  A  ■  i«  niven  by: 

J  J 


X  * 


(4.6) 


wherp  A.,  is  the  mean  reporting  rate  at  each  station  of  user 

*  J 

j.  This  mean  is  given  by 


A i j  =  b^.P  [no  rainfall  in  i] 

8 

+  l  b  •  .  P  [rainfall  in  interval  i  and 
1=2  it  rains  in  ij 

(4./) 


The  notation  P[.j  implies  the  probability  of  the  event 
described  within  brackets. 

The  variance  of  A.  is, 

J 


84 


Var[A.'|  =  ( )2£  l  Var(x. .)  + 

J  11  j  i  =1  'J 


i=l 


l  cov(i\,i2)] 

’z 

(4.8) 


In  the  above  the  variance  of  A^.  is  given  by 

A 

VarU,.]  -  (bfj  -  xu  l2  P  [no  rainfall  in  i] 

8  *  2 

+  I  (b.  i  *  xui  P  [rainfall  in  interval  i  and 
1=2  J  it  rains  in  ij 


(4.9) 


The  covariance  between  point  i^  and  i2  of  user  j,  cov(i^,i2) 
is  given  by: 


cov(iri2)  =  E  [(A< 


V 


■  -  *■ 


i2J 


)] 


^i1  j^bl j  "  Ai2j)  * 


[no  rain  in  i,  and  no 
rain  in  i,,] 


+ 


+ 


+ 


V(b«j 


P  [rainfall  in  interval 
l  and  it  rains  in  i2 
and  not  in  i^] 


X  <b»j  -  v<b'j  ■  ■ p 


[rainfall  in  interval 
J l  and  it  rains  in  i, 
and  not  in  i^] 


8 

I 

*r2 


(b 


A.  .)  .  P  [rainfall  in 
'2J  interval 

and  it  rains  in  1, 
and  rainfall  in 
internal  ju  and 
it  rains  in  i2] 


(4.10) 


where  E  means  statistical  expectation. 


Reasonably  assuming  that  the  rainfall  amounts  are  independent 


and  only  the  occurrence  (or  not)  of  rainfall  anywhere  in 
space  is  dependent,  some  of  the  probabilities  in  (4.10) 
simplify: 


P[rainfall  in  interval  I  and  it  rains  in  ^  one  rot  in  i 2 3 

=  P[rain  in  interval  £| rainfall  in  [rainfall  in 

and  net  in  igj  ! 


P[rainfall  in  interval  and  it  rains  in  i^  and  rainfall  ;n 
interval  1 ^  and  it  rains  in  i £3 

P[rainfall  in  interval  £,|rains  in  i-.'i.P[ rainfall  in  interval 
=  £^|rain  in  i^]  .  P[rainfail  in  i^  and  in  i^]. 


The  notation  P[A|B]  signifies  probability  of  event  A  given 
that  event  B  occurs. 

3.  With  the  mean  and  variance  cf  7.  now  defined,  thp  user  can 

J 

draw  its  distribution  as  in  Figure  4.i. 
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Figure  4.1:  Probabilistic  Distribution  of  A. 

J 


The  user  solves  the  unit  load  equality  implied  by  Eq.  4.3, 


T.  _  5 

Aj  '  360C  T  K 


(4.11) 


and  plots  it  in  Figure  4.1.  The  area  under  the  curve  up  to 

x'-  is  the  probability  that  a  reporting  rate  less  than  or  equal 
0 

to  a',  is  observed.  If  this  area  is  greater  or  equal  to  P 
j  o 

then  the  user  desired  N.  and  reporting  rates  (Equation  4.5) 

J 

are  acceptable.  If  not,  N.  must  be  reduced  or  the  possible 

J 

reporting  rates  reduced  until  the  criterion  is  satisfied. 


4.  Once  the  unit  load  inequality  of  Eq.  4.3  is  satisfied  at  a  given 
probability  level,  the  user  may  use  Equation  4.2  with  his 
chosen  N,  to  obtain 

J 


A“ 

J 


_ 1 _ 

4  N.  T  K 

J 


(4.12) 


If  A'!  >  A},  then  the  user  is  satisfying  unit  loads 

limitation  with  message  success  probabilities  of  95%  or  more 
with  probability  P  .  If  X'!  <  Al  then  the  cross  hatched  area 

^  J  J 

in  Figure  4.1  gives  the  probability  (less  than  PQ)  that 
successful  messages  be  received  95%  of  the  time.  If  this 
probability  is  unacceptable,  the  user's  only  choice  is 
to  further  reduce  N.  or  the  report..;  rates. 

J 


4. 3. 1.2  National  Design 

At  the  national  level,  the  premise  is  that  unit  load  restrictions 
are  satisfied  and  the  issue  is  to  achieve  a  given  level  of  message 
success  at  a  given  probability  level.  Using  Equation  4.2  on  a  national 
scale: 


A*  =  1.0/4  N  T  K 


(4.13) 


where  N  =  l  N.  and  jeJ  means  that  j  is  an  element  in  the  set  J  of  DCP's  using 
jeJ  J 
a  common  channel . 

The  rate  X*  is  the  average  station  rate  that  would  be  required  to 
achieve  95%  message  success  probability  in  a  system  with  deterministic 
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response  rates.  Given  the  variable  national  climate,  in  fact  the 
average  national  reporting  rate  per  station, 


*■5  .1, 

jej 


Ni 

j  .  I  i  L  j 
»  ■  jL  Nj  1^1 


Aij 


(4.14) 


is  a  random  variable,  again  with  distribution  approaching  normality, 
m  is  the  number  of  users  in  set  J. 

The  mean  of  X  is, 

A  A 

«■  ;  [  A .  (4.15) 


where  a,  was  given  by  Equation  4.6. 
J 

The  variance  of  X,  var(X),  is 


Var  [a]  -  ^  l  var  fL] 

m  jeJ  J 


T  I 
- 


m 


l 

J2eJ 


1 


J1  J2  1 1 eN j 


h  * 


1 


I  cov(i.  i?) 

Vj,  * 


(4.16) 


where  the  terms  have  been  previously  defined.  Having 
A  and  var[T|,  the  distribution  of  X  can  be  drawn  as  in 
Figure  4.2. 
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Figure  4.2:  Distribution  of  National  Reporting 
Rate  T 


If  A*  as  given  by  4.13  plots  in  Figure  4.2  such  that  the  area  below 
it  is  greater  or  equal  than  to  a  pre-specified  level  ,  then  the 
National  system  satisfies  all  criteria:  all  users  satisfy  unit  load 
restrictions  with  probability  PQ  and  national  system  achieves  955T  (or 
whatever  chosen  level)  success  rate  with  probability  P, .  If  the  area 
under  A*  is  less  than  P-j,  then  the  alternatives  ere: 
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1)  try  a  oitferent  combination  of  users  in  the  channel,  a 
a  different  set  J. 

2)  require  users  to  reduce  the  reporting  rate. 

3)  and/or  reduce  the  total  number  of  stations  in  the 
national  network  by  requiring  users  to  reduce  their 
number  of  stations. 

There  is  a  political  and  equity  issue  unsolved  in  going  back  to 
users  to  force  redesign,  if  it  comes  to  that.  Within  the  proposed 
scheme,  equity  would  be  defined  by  making  the  distribution  of  A-  (Figure 

J 

4.1)  look  the  same  for  all  users  j.  Clearly,  though,  this  objective  is 
flawed  since  it  could  possibly  force  some  users  to  increase  rates  and 
numbers  of  stations.  Therefore,  downward  adjustments  should 

A 

start  with  1)  users  with  large  A.  and  large  variance,  var[A,J;  2)  users 

J  J 

with  small  areas  below  A^j  in  Figure  4.1;  3)  users  approaching  the  PQ 
criterion  in  satisfaction  of  unit  load  requirements. 
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Chapter  5 

Climatic  Data  Analysis 

5.1  Required  Climatic  Analysis 

To  perform  the  analysis  presented  in  Chapter  4,  we  require  the 
definition  of: 

1 )  P  |_no  rainfall  in  i] 

2)  P  [rainfall  in  depth  interval  i  and  rainfall  in  i] 

3)  P  [no  rain  in  i^  and  no  rain  in  i^] 

4)  P  [rainfall  in  i,  and  not  in  i,J 

5)  P  [rainfall  in  i2  and  not  in  i^j 

6)  P  [rainfall  in  i1  and  in  i\>J 

Inherent  in  the  formulation  and  in  the  above  required  probabilities 
are  several  assumptions: 

1)  we  have  information  on  storm  accumulation; 

2)  the  above  probabilities  exist  for  known  locations 
of  DCP's. 

Both  of  the  above  are  not  true,  so  approximations  are  required. 

For  the  sake  of  feasibility,  the  daily  rainfall  accumulations  are  taken 
as  surrogates  of  storm  depth.  Clearly  this  is  in  error  if  multiple 
storms  occur  in  a  day  and/or  if  storms  last  longer  than  a  day.  We  feel 
that  most  important  occurrences  will  be  well  represented  within  this 


.1 

.1 


i 


1 


j 


1 


24-hour  period.  The  choice  will  create  operational  problems  in  making 
streamflow-rainfall  comparisons,  necessary  to  rationally  obtain 
reasonable  and  valid  reporting  rates,  b^,  needed  for  analysis.  This 
will  be  addressed  in  Chapter  6.  On  the  other  hand,  a  24-hour  accumulation 
of  rainfall  will  lead  to  higher  correlations  in  space,  a  conservative  assump¬ 
tion  that  will  tend  to  limit  allowable  numbers  of  DCP's  in  a  channel. 

The  assumption  that  potential  locations  of  DCP's  are  known  and  that 
the  necessary  data  or  probabilities  exist  is  untenable.  At  best,  the  necessary 
data  will  be  available  (as  will  soon  be  presented)  over  a  reasonably  dense  grid 
covering  the  conterminous  United  States.  The  agreement  of  grid  points  with 
a  future  or  existing  DCP  location  would  be  coincidental.  Given  the  above  situation, 
it  will  be  assumed  that  available  data  (at  points)  are  representative  of  a  homo¬ 
geneous  climate  over  its  "area  of  influence."  Its  area  of  influence  will  be 
defined  by  its  corresponding  Thiessen  polygon.  It  will  then  be  assumed  that 
DCP's  within  a  polygon  have  the  probabilistic  and  climatic  characteristics  of 
the  corresponding  data  point.  Since  the  locations  of  potential  DCP's  within  users, 

C  of  E  Districts,  are  not  known,  it  will  be  further  assumed  that  they  will  be 
uniformly  distributed  within  each  District.  For  example  (see  Figure  5.1),  de¬ 
fine  N.  as  the  number  of  stations  in  District  j,  in  the  Figure  there  are  2 
districts  separated  by  a  sinuous  boundary;  define  a^  as  the  sub-area  of  the 
Thiessen  polygon  i  (i.e.,  data  point  i)  within  district  j;  and  A. 

si 

the  area  of  district  j.  Then,  the  number  of  stations  in  district  j, 
responding  to  the  climate  of  data  point  i  is. 


’  1 
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(5.1) 


Say  that  in  Figure  b.l,  if  the  area  of  district  2,  A2,  were  100,  with 
a12  =  70,  a32  =  25,  and  a4.7  =  5,  and  if  100  UCP  were  to  be  allocated 
for  District  2,  then  there  would  be 


N 


„  70 
12  "  W 


100  =  70 


DCP's  responding  according  to  the  climate  of  point  1,  and 


i  -  25 

32  “  TOff 


100  =  25 


responding  according  to  point  3  and  similarly  5  (or  N42)  responding  according 
to  point  4. 


5.2  Climatic  Data  Sources 

A  search  of  literature  and  data  sources  yielded  several  good  leads 
in  the  statistics  of  rainfall  over  the  U.S.A.  A  brief  summary  of  these 
sources  follows: 

1.  Klein  [32]  gives  a  complete  study  of  tracks  (primary  and 

secondary)  as  well  as  the  frequency  of  genesis  of  low  pressure 
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centers  (cyclones)  over  the  Northern  Hemisphere.  This 
information  is  peripheral,  but  it  could  provide  qualitative 
information  as  to  where  to  expect  correlation  between  the 
occurrence  of  storm  events. 

Jorgensen  and  Jorgensen,  et  al ,  [.30,  31]  use  a  data  set 
with  108  stations  over  the  conterminous  U.S.A.  and  15  years  of 
record  to: 

a)  give  the  probability  of  rainfall  in  any  of  the  108 
stations,  on  a  monthly  and  seasonal  basis  for  any  of 
7  different  time  periods  in  a  day  (0-6,  b-12,  12-18, 
18-24,  0-12,  12-24,  0-24). 

b)  give  the  probability  of  rainfall  depths  in  7 
intervals  (0.01-0.1,  U. 1-0. 25,  0.25-0.50,  0. bU-1.00, 
1.00-1.5,  1. 5-2.0,  2.U  or  greater)  conditional  on 
the  occurrence  of  rainfall  in  any  of  the  108 
stations  for  the  7  different  time  periods  described 
above  for  four  seasons. 

c)  give  the  number  of  "wet"  and  "dry"  periods  per 
station,  per  season. 

d)  hypothesize  that  climatic  behavior  over  tne  u.S.A. 
can  be  divided  in 

—  east  and  west  of  the  Kockies 
--  and  according  to  average  rainfall 
accumulation  per  wet  period. 

e)  hypothesize  that  rainfall  depths  are  gamma 


distributed. 


The  original  data  set  of  Jorgensen  is  apparently  lost  and 
efforts  to  obtain  it  failed. 

3.  A  very  complete  data  set  was  located  at  the  University  of 
Illinois.  After  extensive  discussions  the  following  has  been 
concluded: 

a)  Ihe  full  data  set  includes  51  tapes.  Processing  of 
desired  stations,  including  some  accounting  for  data 
errors  is  infeasible  due  to  budget  limitations. 

b)  A  data  suoset  of  about  200  stations  exists. 

Unfortunately,  this  is  storm  data.  The  actual  time  of 
storm  occurrence  is  not  preserved  in  the  subset,  uur 
analysis  is  impossiDie  without  this  information. 

4.  Some  recent  literature  (34)  argues  that  tne  distribution  of 
storm  areas  may  be  obtained  from  point  (station)  information. 
This  applies  to  nomogeneous  climatic  regions.  In  order  to  use 
this  concept,  we  must  make  subjective  assessment  of  these 
regions.  This  is  possiDie,  but  considered  a  secondary 
approach  to  obtain  the  information  we  needed. 

5.  The  N.W.s.  lechniques  Development  Laboratory  uses  a  data  set 
containing  about  10  years  of  data,  in  an  hourly  basis,  for 
about  2bU  stations  throughout  the  u.S.A.  The  Techniques 
Development  Laboratory  cooperated  in  selling  us  this 
information.  The  nature  of  these  data  will  be  better  defined 
in  the  next  subsection.  It  is  important  to  state  that  this 
data  set  is  the  one  used  to  calibrate  weather  forecasting 
models  used  by  the  N.W.S. 
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It  is  important  to  note  that  Jorgensen,  et  al ,  (31)  provide  us 
with  the  first  2  probabilities  needed  in  this  analysis.  As  Section  5.3 
will  discuss,  it  is  possible  to  approximate  al^  the  necessary 
probabilistic  information  from  their  analysis.  A  more  complete  approach 
uses  TDL's  raw  data.  This  alternative,  although  considerably  more 
computationally  burdensome,  was  selected.  It  is  discussed  in  Section 


5.2.1  The  Techniques  Development  Laboratory  (TDL)  Data  Set 


The  National  Weather  Service's  Techniques  Development  Laboratory 
compiles  data  for  255  stations:  236  over  the  conterminous  U.S.A.,  14  in 
Alaska,  4  in  Hawaii,  and  1  in  Puerto  Rico.  The  data  includes  several 
parameters,  viz.,  precipitation,  temperature,  dew  point  temperature, 
wind,  etc.,  and  is  stored  in  3  hour  intervals.  These  data  are  utilized  to 
develop  meteorological  predictors  which  form  part  of  the  Multiple  Output 
Statistics  (MOS)  system  (33). 

The  Techniques  Development  Laboratory  agreed  to  sell  a  portion  of 
this  data  set.  Analyzed  were  daily  rainfall  accumulations  of  the  236 
stations  over  the  conterminous  U.S.A.  The  period  of  record  available 
consists  of  9  years  from  October  1,  1972  through  September  30,  1981. 

An  alphabetical  listing  of  all  stations  is  given  in  Table  5.1. 
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e  5.1:  Names  of  Stations  in  Techniques  Development 
Laboratory  Data  Set  (33) 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 


STATION 


ID  NUMBER 


3103FLAGSTAFF  *  ARJ7. 
3012ASHEVILLE  *  NC 
381 3MAC0N  »  GA 
3820AUGIJST  A  *  GA 
3822SAVANNAH *  GA 
3856HUNTSVIIJ.  E  *  ALA 
3860HUNT INGTON *  W  VA 
3870GREENVILLE*  SC 
3872BECKLE Y *  M  VA 
3927F0RT  WORTH *  TEX 
3928WI CH I T  A  *  KANB 
3937I.AKF.  CHARLES *  LA 
3940 JACKSON »  MISS 
3945C0LUMBIA *  MO 
3947KANSAS  CITY*  MO 
4 725 BINGHAMTON *  NY 
4751  BRADFORD  *  PA 
1 1641  SAN  JUAN »  P.R. 
12834DAYT0NA  BEACH ►  FLA 
12835F0RT  MYERS  *  FLA 
12836KEY  WEST  *  FLA 


1 

2 

3 

4 

5 

6 

7 

8 
o 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 


1 2839MIAMI »  FLA  22 
128410Rl.ANn0,  FLA  23 
12842TAMPA*  FLA  24 
1 2844WEST  PALM  BEACH*  FLA  25 
1 2884B00THVILLE  *  LA  26 
1 29 12V I C TOR  I A  *  TEX  27 
12916NEW  ORLEANS*  LA  28 
12919BR0WNSVILI.E  *  TEX  29 
12921SAN  ANTONIO*  TFX  30 
12924C0RPUS  CHRISTI*  TEX  31 
1 2960H0UST0N »  TEX  32 
13722RALFIGH-BURHAM*  NC  33 
13723GREENSB0R0*  NC  34 
1 3729EI.K INS  *  W  VA  35 
1 373 3LYNCH BURG*  VA  36 
1 3737N0RF0LK  *  VA  37 
13739PHII.ABELPHI  A  *  PA  38 
13740RICHM0NB*  VA  39 
13741  ROANOKE*  VA  40 
13743WASHINGT0N*  BC  41 
1374  8W II.  MING  TON*  NC  42 
13781  WILMINGTON*  BEL  43 
13B65MERIBI AN*  MISS  44 


13066CHARI.EST0N  *  W  VA  45 


Table  5.1:  Cont 


STATION  ID 

NUMBER 

46 

1 3873ATHENS  *  GA 

46 

47 

13074ATI.ANTA.  BA 

47 

48 

130 76 BIRMINGHAM*  At.  A 

48 

49 

1 3877BRIST0L  »  TENN 

49 

50 

13080CHARLEST0N  •  SC 

50 

51 

13881 CHARLOTTE *  NC 

51 

52 

1 3882CHATTAN00GA *  TENN 

52 

53 

13883C0LUMB I A  *  SC 

53 

54 

1 3889 JACKSONVILLE  *  FLA 

54 

55 

1 3891KN0XVILLE »  TENN 

55 

56 

13893MEMPHIS*  TENN 

56 

57 

13894M0BILE *  ALA 

57 

58 

1 389SMONTGOMERY »  ALA 

58 

59 

13B97NASHVILLE*  TENN 

59 

60 

1 3899PENSAC0LA *  FLA 

60 

61 

1 3935 ALE X ANDRIA  *  LA 

61 

62 

1 3957SHREVEP0RT  *  LA 

62 

• 

63 

13958AUSTIN*  TEX 

63 

64 

1 3959WAC0*  TEX 

64 

65 

1 3960DALLAS *  TEX 

65 

66 

13962ABILENE*  TEX 

66 

67 

13963I.ITTLE  ROCK*  ARK 

67 

*  - 

68 

1 3964F0RT  SMITH*  ARK 

68 

69 

13966WICHITA  FALLS*  TEX 

69 

'J 

70 

1 39670KI. AHOMA  CITY*  OKI.A 

70 

;  \ 

71 

1 3968TULSA  *  OKI. A 

71 

‘ 

72 

13970BAT0N  ROUGE*  LA 

72 

73 

1 3984C0NC0RDI A  *  KANS 

73 

74 

13985B0BGE  CITY*  KANS 

74 

* 

75 

13993ST  JOSEPH*  MO 

75 

76 

13994ST  LOUIS*  MO 

76 

77 

13995SPRINGFIEL.D  *  MO 

77 

-  ■ 

78 

13996T0PEKA  *  KANS 

78 

• " 

79 

1 4606BANG0R  *  ME 

79 

80 

14607CARI BOU*  ME 

80 

81 

14732NEW  YORK*  NY 

81 

82 

14733BUFFAL0*  NY 

82 

' 

83 

14734NEWARK  *  NJ 

83 

. 

84 

14735ALBANY*  NY 

84 

•  ■ 

85 

14 737 ALLENTOWN*  PA 

85 

-  -  - 

86 

14739B0ST0N*  MASS 

86 

87 

14740HARTF0RJJ*  CONN 

87 

« 

88 

14742BURLINGT0N*  VT 

88 

i 

< 

89 

14745C0NC0RD*  NH 

89 

< 

90 

14751  HARRISBURG*  PA 

90 

• : 

100 


Table  5.1 :  Cont 


STATION  ID  NUMBER 

91  14764P0RTI.AND,  ME  91 

92  14765PR0VIDENCE ,  RI  92 

93  14768R0CHESTER »  NY  93 

94  1 4771  SYRACUSE ,  NY  94 

95  14777WB  SCRANTON  »  PA  95 

96  1 4778WILL.  IAMSPORT »  PA  96 

97  14819CH1CAS0  MIDWAY,  ILL  97 

98  14820CI.  FUEL  AND *  OHIO  98 

99  1 482 1 COLUMBUS ,  OHIO  99 

100  14826FI.  JNT  ,  MICH  100 

101  14827F0RT  WAYNE »  IND  101 

102  14836LANSING*  MICH  102 

103  14837MADIS0N»  WIS  103 

104  14839MII.WAUKEE »  WIS  104 

105  1 4840MUSKE60N ,  MICH  105 

106  14842PE0RIA,  ILL  106 

107  14847SAULT  ST  MARIE,  MICH107 

108  1 4848S0UTH  BEND*  IND  108 

109  14850TRAVERSE  CITY,  MICH  109 

110  14 85? YOUNGSTOWN ,  OHIO  110 

111  1 4860ERIE ,  PA  111 

112  14895AKR0N-CANT0N ,  OHIO  112 

113  14898GREEN  BAY,  WIS  113 

114  1 49 1 3DUI.UTH ,  MINN  114 

115  14914FARG0,  N  DAK  115 

116  14918INTL  FALLS,  MINN  116 

117  1 4920I..ACR0SSE ,  WIS  117 

118  1 4922MTNNEAP0L IS,  MINN  118 

119  14923M0LINE ,  ILL  119 

120  1 4925R0CHESTER ,  MINN  120 

121  14929ABERDEEN,  S  DAK  121 

122  14 931 BURLINGTON,  IOWA  122 

123  1 4933DES  MOINFS,  IOWA  123 

124  14935GRAND  ISLAND,  NEBR  124 

125  14936HUR0N ,  S  DAK  125 

126  1 4940MAS0N  CITY,  IOWA  126 

127  1 49420MAHA ,  NEBR  127 

128  14943SI01JX  CITY,  IOWA  128 

129  1 4944SIQUX  FALLS,  S  DAK  129 

130  14991EAU  CLAIRE,  WIS  130 

131  ?1 504HIL0 ,  HI  131 

132  22010DFI.  RIO,  TEX  132 

133  22516KAHULUI ,  HI  133 

134  22521H0N0LUI.U,  HI  134 

135  22536L. IHUE ,  HI  135 


•  ' 


»  I 
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Table  5.1 :  Cont 


STATION  ID  NUMBER 

136  23023MIDLAND.  TF.X  136 

137  23034SAN  ANGELO.  TEX  137 

138  23042LUBBQCK »  TEX  138 

139  23044EI.  PASO.  TEX  139 

140  23047AMARILL0 »  TEX  140 

141  23048TUCUMCAR I »  N  HEX  141 

142  23050ALBUPUFRPUE »  N  HEX  142 

143  23062DENVER.  COLO  143 

144  23065G00DL AND.  KANS  144 

145  23066GRAND  JUNCTION.  C0L0145 

146  23090FARMINGT0N.  N  HEX  146 

147  231291. ONG  BEACH.  CALIF  147 

148  23153T0N0PAH.  NEV  148 

149  23154ELY.  NEV  149 

150  23 155 BAKERSFIELD.  CALIF  150 

151  23159BRYCE  CANYON.  UTAH  151 

152  23160TUCS0N,  ARI7.  152 

153  23161DAG6ETT.  CALIF  153 

154  23169LAS  VEGAS .  NEV  154 

155  23174L0S  ANGELES.  CALIF  155 

156  23183PH0EWIX.  ARIZ  156 

157  23185REN0.  NEV  157 

158  23188SAN  DIEGO.  CALIF  158 

159  231 94WINSL.0W .  ARIZ  159 

160  23I95YUMA.  ARIZ  160 

161  23 2300 AKL AND .  CALIF  161 

162  23232SACRAMENT0.  CALIF  162 

163  23234SAN  FRANCISCO.  CALIF163 

164  23237ST0CKT0N »  CALIF  164 

165  23273SANTA  MARIA.  CALIF  165 

166  240X1  BISMARCK .  N  DAK  166 

167  24013MIN0T.  N  DAK  167 

168  24018CHEYENNE »  WYO  168 

169  24021LANDER.  WYO  169 

170  24023N0RTH  PLATTE.  NFBR  170 

171  24025PIERRE.  S  DAK  171 

172  24027R0CK  SPRINGS.  WYO  172 

173  24028SC0TTSBUJFF.  NEBR  173 

174  24029SHERIDAN.  WYO  174 

175  24033BILLINGS.  MONT  175 

176  24089CASPER.  WYO  176 

177  24090RAPID  CITY.  S  DAK  177 

178  24 121  El.  K0»  NEV  178 


Table  5.1 :  Cont. 


station  ID  number 

179  24127SALT  I.AKE  CITY,  UTAH  1 79 

180  24128WINNEMUCCA  ,  NFV  180 

181  24131B0ISE*  IDAHO  181 

182  24 134 BURNS*  ORES  182 

183  24 1 43GREAT  FAl  l.S»  MONT  183 

184  24 1 44HFLFNA  *  MONT  184 

185  24 14 AKAL I  SPELL  ,  MONT  185 

186  241 53MISS0ULA  *  MONT  186 

187  24155PENDLFT0N  *  OREG  187 

188  24156POCATF.LLO*  IDAHO  188 

189  24 1 57SP0KANF.  *  WASH  189 

190  241 72L0UELQCK  *  NEV  190 

191  24 193WEND0VER ,  UTAH  191 

192  2421ARED  BLUFF*  CAL T F  192 

193  24221EUGENE ,  ORFB  193 

194  24225MEDF0RD  *  OREG  1Y4 

195  242270L.YMPIA*  WASH  195 

19A  24229P0RTL AND *  OREG  196 

197  24230REDM0ND  *  OREG  197 

198  24232SALEM*  OREG  198 

199  24233SEATTI.F.-TAC0MA »  WASH  1 99 

200  24243YAKIMA*  WASH  200 

201  24283ARCATA  *  CALIF  201 

202  24284N0RTH  BEND *  OREG  202 

203  25308ANNETTE *  AK  203 

204  25309 JUNEAU*  AK  204 

205  25339YAKUTAT  *  AK  205 

206  25503KING  SALMON*  AK  206 

207  25A24C0LD  BAY*  AK  207 

208  25713ST.  PAUL  ISLAND*  AK  208 

209  26411  FAIRBANKS*  AK  209 

210  2A451 ANCHORAGE  *  AK  210 

211  26510MC6RATH  *  AK  211 

212  26A15BETHEL*  AK  212 

213  26616K0TZE8UF*  AK  213 

214  2661 7N0ME »  AK  214 

215  27401BARTER  ISLAND*  AK  215 

216  27502BARR0W *  AK  216 

217  930  37C  01..  OR  ADO  SPOS*  COLO  217 

218  93044ZIIN I  *  N  MEX  218 

219  93045TRUTH  OR  CONS*  N  MEX219 

220  93058PUEBL  0*  COl  O  220 

221  93 1 29CFDAR  CITY*  UTAH  221 

222  93 1 93FRFSN0  *  CALIF  222 
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Table  5.1 :  Cont. 


STATION 


ID  NUMBER 


223  93721 BALT  I MORE  r  MB  223 

224  93729CAPE  HATTERAS  t  NC  224 

225  93730ATI..ANTIC  01  TY »  NJ  225 

226  93738WA3H--BIILLES »  VA  226 

227  93739WAI.I.0PS  ISLAND*  VA  227 

228  93805TAI  l.AMASSFE,  FLA  228 

229  93 8 140 1 NO. INN AT  1 1  OHIO  229 

230  93815BAYT0N*  OHIO  230 

231  93817EVANSVILLE*  IND  231 

232  9381 9INBI ANAPOL IS »  INB  232 

233  93820LEX INGTON »  KY  233 

234  93821  LOUISVILLE »  KY  234 

235  93822SPRINGFIELD»  ILL  235 

236  93987LUFKIN*  TEX  236 

237  93997RIJSSELL,  KANS  237 

238  94008GLASG0Wf  MONT  238 

239  94012HAVRE*  MONT  239 

240  94014WII  LISTON*  N  DAK  240 

•241  94 224 AS TORI A  *  ORF.G  241 

242  942400UII. LAYUTE  r  WASH  242 

243  94702RRIDGEP0RT *  CONN  243 

244  94725MASSENA »  NY  244 

245  94789NEW  YORK r  NY  245 

246  9481 4HOUOHTON  LAKE  r  MIOH  246 

247  94822R00KF0RB  *  ILL  247 

248  94823PITTSBURGH  f  PA  248 

249  94830T0LED0,  OHIO  249 

250  9484 ACHIC AGO »  ILL  250 

251  94847BETR0IT  *  MIOH  251 

252  94849ALPENA  *  MICH  252 

253  94860DRANB  RAP IBS t  MIOH  253 

254  94908BUBUPUE »  IOWA  254 

255  94910WATERL00 *  IOWA  255 
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5.3  Approximate  Analysis  Based  on  Jorgensen  et  al.  (31) 

as  previously  stated,  Jorgensen  et  al.  (31)  give  the  first  two  items  of  the 
necessary  probabilistic  information  for  lUtt  stations  over  the  U.S.A. 

The  locations  ot  the  stations  are  illustrated  in  Figure  5.2.  The 
probabilistic  information  is  given  for  the  necessary  24-hour  period  and 
a  reasonable  discretization  of  rainfall  depths.  Lacking  were  the 
necessary  joint  probabilities,  items  3  througn  6  in  Section  5.1. 

Since  the  original  data  set  is  unavailable,  the  missing  probabilities 
are  not  attainable  except  through  approximations. 

For  example,  assuming  complete  independence  would  result  in: 


P[rain  in  and  rain  in  i2]  =  P[rain  in  i^j  PLrain  in  i2] 


P[no  rain  in  i^  and  no  rain  in 

=  P[no  rain  in  i^]  PLno  rain  in  i2] 


P[rain  in  ij  and  no  rain  in  i.,]  =  P[rain  in  ij]  P[no  rain  in  i2] 


1 


»  ' 


I  : 
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Figure  5 


ions  of  stations  in  Jorgensen 


's 


Pl.no  rain  in  ij  and  rain  in  i2  =  P[rain  in  i^J  P[no  rain  in  i^j 


The  above  are  fully  defined  by  the  Jorgensen,  et.al.,  (31;  results  and 
readily  obtained.  They  respresent  a  lower  limit  tor  the  joint 
probabilities  of  events  or  no  events.  This  is  a  minimum  positive 
correlation  case. 

A  maximum  positive  correlation  case  is  obtained  as  follows.  If 
is  the  number  of  rainy  events  in  i,,  N.  is  the  number  of  rainy 

i  i2 

events  in  i2,  and  M  the  number  of  days  in  the  record,  then 

MinLN,  ,  N.  ] 

1  2 

P[rain  in  i  (  and  rain  in  i2]  <  - - =  P1  (5.2) 

Min[M-N.  )  ,(M-N .  )J 
I  2 

P[no  rain  in  i,  and  no  rain  in  i?]  <  - jjj -  -  °2 

(5.3) 


For  N.  >  N. 

il  i2 

N  -  N. 

1 1  2 

P[rain  in  i^  and  no  rain  in  i2J  :  - ^ - =  P-j  (5.4) 

P[no  rain  in  i^  and  rain  in  i2]  ;  Pg 
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~9 

For  N.  >  N- 

'2  M 

-  s  a 

Prrain  in  i2  and  no  rain  in  i^j  ~  — — ^ 

P[rain  in  i^  and  no  rain  in  i ^3  -  P3 

If  this  approach  were  to  be  taken,  a  set  of  rules  would  have  to  be 
defined  in  order  to  decide  when  to  use  the  independence  or  the  maximum 
positive  correlation  case.  One  such  set  may  be: 

1)  Stations  east  of  the  Rockies  are  independent  of  stations  on  or 
west  of  the  Rockies. 

2)  Within  the  east  and  west  regions  independence  will  be  tested 
by  comparing  mean  storm  depths  over  the  24  hour  period. 

3)  Areas  of  doubt  will  be  decided  on  the  basis  of  qualitative 
mean  storm  track  knowledge  as  given  by  Klein,  (32). 

A  possible  statistical  test  is  the  t  test  for  two  independent 
samples  when  variances  are  assumed  equal.  The  hypothesis  is  that  the 
difference  of  the  true  means  is  zero.  The  test  is  based  on  the 
statistic 

(5.6) 

where 


X1  -  x2 


S  (—  +  —  )* 

n  '  n  _  n  / 


(5.5) 
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a  t  distributed  variable 


The  t  value  is  compared  with  t  „  ,  „ 

r  a.n-|  +  rig  -  2 


with  n1  +  n2  -  2  degrees  of  freedom  at  an  o  level  of  significance. 

In  our  case  n-j  and  n2  are  the  number  of  rainfall  days  in  a  season  at  stations  1  and 
2.  The  means,  x^  and  x2  are  the  average  accumulations  for  a  given  time  period. 

The  combined  variance  can  be  computed  as: 


Hx 

i=l 


n 


~  +  ^  (*2l~  x2)  ■  n2 

n^  +  n2  -  2 


n]  +  n2S2 
n^  +  n2  -  2 


(5.8) 


where  P[x  ]  is  the  probability  that  if  it  rains  the  amount  will  be  in 
the  tth  interval  of  the  7  defined  by  Jorgensen,  et  al  (31). 

Following  are  two  examples. 

For  the  spring  months  of  March,  April  and  May  the  average  24  hour 
accumulation  in  Kansas  City,  Mo.,  is  0.32  in.  The  corresponding  average 
in  Hartford,  Conn.,  is  0.32  in.  Are  the  two  means  statistically  the 
same? 

For  an  answer,  use  Equation  5.8,  with  x^  as  the  mid  value  of  interval  %  as 
defined  by  Jorgensen  et  al.  (31), 


For  Kansas  City,  =  464 

S2  =  (.045  -  -32)2(1 .00  -  .62)  +  (.175  -  .32)2(.62  -  .40) 

+  (.375  -  . 32 )2 ( .4  -  .22)  +  (.75  -  .32)2(.22  -  .07) 

+  (1.25  -  . 32 )2( . 07  -  .03)  +  (1.75  -  -32)2( .03  -  .01) 

+  (3  -  .32)2  .01  =  .21 

For  Hartford,  n2  =  508 

S2  =  (.045  -  .31 )2(1  -  .64)  +  (.175  -  .31 )2(-64  -  .43) 

+  (.375  -  .31 )2( .43  -  .21)  +  (.75  -  .31 )2( .21  -  .06) 

+  (1.25  -  .31 )2( .06  -  .02)  +  (1.75  -  .31 )2(.02  -  .01) 

+  (3  -  .31 )2( .01 )  =  .19 


Therefore , 


i 


The  t  statistic  is 


t  = 


.32 


.31 
‘  508 


IT 


•41<464 


.38 


The  t  distribution  at  a  =  .01  for  a  1  sided  test  is  2.326.  The 
comparison  indicates  that  our  value  is  well  inside  that  upper  value 
indicating  that  we  cannot  reject  the  hypothesis  that  the  means  are  the 
same. 

For  Louisville,  Ky.,  n^  =  534 
SZ  =  0.23 

when  compared  to  Hartford 

_  534 ( .23)  +  508 (.19)  _ 

So - 534  +  508  -7  ’ 


and 


t  =  _ .36  -  .31 

+ 


=  1.75 


which  indicates  that  the  hypothesis  of  equal  means  is  accepted  at  the 
0.01  level  but  is  rejected  at  the  0.05  level.  This  would  be  an  unclear 
case,  but  it  indicates  relative  power  of  the  test. 


Ill 


The  most  obvious  approach  to  estimate  the  necessary  joint 


probabilities  is  to  analyze  raw  data.  With  the  TDL  data  set,  the  record 
can  be  scanned  to  count  the  number  of  rainy  days  in  any  one  station;  the 
number  of  days  with  rain  in  a  given  depth  interval  &  in  station  i;  the 
number  of  coincident  wet  days  in  any  pair  of  stations;  dry  days  in  any 
pair  of  stations;  and  wet-dry  in  any  pair  of  stations.  Define  these  as 

1  o  o 

N • ,  N,  ,  n!  ■  ,  Nf  ■  ,  fr  ■  respectively.  Then  probability  estimates  are 
i  u  V2  1 1  ^ 


P  L fa i n  in  i^  and  rain  in  i^J 


NU 


M 


P  L rain  in  i^  and  no  rain  in  i.]  = 


Ni  i 

V2 


M 


P  [rain  in  i^  and  no  rain  in  i^j  = 


Ni  i 
1 1  '2 


M 


(5.9) 


(5.10) 


(5.11) 


P  Lno  rain  in  i^  and  rain  i^j  =  1  - 


(n!  .  +  N?  •  +N?  .  ) 

V2  1 1 1 2  V2 


(5.12) 


P[rain  in  ij  -  rj- 


(5.13) 


P[no  rain  in  i]  =  1- 


(5.14) 


Ni 


M 


P[rainfall  in  depth  interval  i  and  rainfall  in  i i  =  -pp 


(5.15) 
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CHAPTER  6 


Model  Application  and  Case  Study 


6.0  Introduction 

Appendix  A  provides  an  illustrative  example  of  how  the  data 
analysis  program  works.  This  Chapter  intends  to  illustrate  the 
procedure  with  a  somewhat  more  realistic  case  study.  In  doing  so,  the 
following  objectives  will  be  satisfied: 

1.  Discuss  the  preliminary  steps  necessary  for  network  design. 
These  include: 

a.  computer  coding  the  Corps  of  Engineers  Districts  map 
and  precipitation  stations  locations. 

b.  obtaining  Thiessen  areas  for  each  station  in  each 
C.  E.  district  in  the  nation. 

c.  processing  the  precipitation  data  from  the  Techniques 
Development  Laboratory. 

2.  Illustrate  how  to  obtain  transmission  rates  and  relate  them  to 
rainfall  totals. 

3.  Show  how  to  evaluate  hypothetical  networks  for  various  Corps  of 
Engineers  districts  using  a  single  satellite  channel  and 
adaptive  random  reporting. 


6.1  Data  Pre-Processing  and  Analysis 

The  purpose  of  this  section  is  to  review  the  key  steps  in  the 
geographical  and  climatological  data  analysis  required  to  generate  the 
files  for  both  prototype  and  production  model  application  of  DCPMAIN. 
Details  of  the  character  of  the  files  needed  for  input  to  DCPMAIN  are 
included  in  Appendix  A.  Details  of  the  analysis  models  used  to  produce 
these  files  are  presented  in  Appendix  B. 

Two  distinct  data  sets  and  types  of  information  are  required  by 
DCPMAIN.  First  is  the  geographical  information  --  the  Thiessen  areas 
and  associated  rainfall  stations  for  candidate  users.  Because  this 
project  focuses  on  design  of  random  reporting  data  collection  networks 
for  C  of  E  users,  the  existing  Corps  district  structure  was  used  as  the 
basis  for  identifying  candidate  users.  There  are  currently  36  C  of  E 
districts,  as  shown  in  Figure  6.1,  and  each  district  was  assigned  a 
user  number.  User  numbers  are  also  displayed  in  Figure  6.1. 


The  computation  of  Thiessen  areas  and  associated  rainfall  stations 
required: 

1.  a  digitized  map  to  display  the  relative  location  of  user 
boundaries;  and 

2.  the  location  of  rainfall  data  collection  stations. 

In  order  to  construct  the  digitized  district  map,  a  224  by  136  cell  grid 
overlay  of  the  "December  1970  Division  and  District  Boundaries"  map  of 
Corps  districts  was  developed.  Each  point  on  the  grid  was  assigned  to  a 
Corps  district,  or  a  "null"  district  outside  of  the  area  ot  interest. 
This  assignment  was  accomplished  using  program  CAPPER  which  takes  as 
input  data  on  district  boundary  coordinates.  The  resultant  digital  map 
was  verified  and  stored  in  File  MAP.  A  display  of  the  output  map  is 
shown  as  Figure  6.2.  The  information  on  rain  gauge  station  locations 
was  developed  from  NWS  publications  and  TDt  data  sheets.  The  longitude 
and  latitude  locations  were  transformed  to  grid  locations  fo»  input  to 
Program  MAPPER,  the  program  for  computation  of  Thiessen  area.;.  For  each 
of  the  36  user  areas,  the  rainfall  stations  influencing  the  area,  and 
the  fraction  of  total  district  area  which  each  station  influenced,  were 
stored  in  FIL1,  the  geographical  information  file,  generated  by  program 
MAPPER. 
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Figure  6.2:  Digitized  District  Map,  horizontal 
and  vertical  scales  are  distorted 


Vue  second  key  type  of  information  required  is  climatological  data 


--  specifically  information  on  rainfall  throughout  the  conterminous 
United  States.  As  noted  in  Section  5.2.1,  the  NWS  TDL  data  set  of 
rainfall  data  at  6-hour  intervals  for  255  stations,  was  selected  as  the 
most  suitable  source  for  climatological  data  analysis.  This  data  for  9 
years  from  October  1,  1972,  through  September  30,  1981,  was  obtained  on 
magnetic  tape. 

The  data  were  analyzed  using  Program  ZAPPER  to  evaluate  numbers  of 
events,  and  TAPPER  to  convert  these  numbers  to  relative  frequencies  of 
occurrence  of  events.  The  analysis  was  conducted  for  4  seasons  as 
follows: 

Season  Months 

1  December,  January,  February 

2  March,  April,  May 

3  June,  July,  August 

4  September,  October,  November 

For  each  season,  two  types  of  information  were  generated: 

1.  A  station  file  containing  information  on  probabilities  of 
rainfall  for  various  depth  classes  for  each  of  the  255  rainfall 
stations;  and 

2.  A  cross  or  paired  station  file  containing  information  on 
probabilities  of  joint  occurrence  of  rainfall  between  each  of 
the  possible  pairs  of  rainfall  stations. 
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These  data  provide  the  climatological  input  to  the  analysis  of  random 
reporting  data  collection  platform  networks. 


The  type  of  data  needed  have  been  described  extensively  in  Chapter 
5.  For  each  station,  the  probabilities  of  events  in  the  following  depth 
intervals  and  under  the  following  conditions  are  obtained: 


a. 

0.00  - 

0.01 

inches 

b. 

0.01  - 

0.50 

inches. 

given 

rain 

c. 

0.50  - 

1.00 

inches , 

given 

rain 

d. 

1.00  - 

1.25 

inches , 

given 

rain 

e. 

1.25  - 

1.50 

inches , 

given 

rain 

f. 

1.50  - 

2.00 

inches, 

given 

rain 

g- 

2.00  - 

3.01 

inches. 

given 

rain 

h. 

3.00+ 

inches 

,  given 

rain. 

In  addition  for  each  pair  of  stations  the  following  probabilities  are 
obtained: 

a.  no  rainfall  at  i  and  no  rainfall  at  j 

b.  no  rainfall  at  i  and  rainfall  at  j 

c.  rainfall  at  i  and  no  rainfall  at  j 

d.  rainfall  at  i  and  rainfall  at  j. 

The  station  data  are  stored  in  files  F2S1 ,  F2S2,  F2S3,  and  F2S4.  The 
cross  station  or  paired  station  data  are  stored  in  files  F3S1 ,  F3S2, 

F3S3 ,  and  F3S4. 

This  completes  the  overview  of  the  geographical  and  climatological 
data  processing  and  analysis.  The  sections  that  follow  present  first  a 
discussion  of  determination  of  reporting  rates,  and  then  a  review  of  the 
prototype  model  application. 


6.2  Determining  Reporting  Rates 


Recall  from  Chapter  2  that  most  DCP's  set  their  message  transmision 
rate  according  to  an  algorithm  of  the  form: 

RATE  =  MAX [BASE  RATE,  (A*CHANGE  IN  PARAMETER)] 

For  example,  the  New  England  Division  has  suggested  the  following 

parameters  for  a  typical  station  (Jewett  City,  CT)  in  their  area: 

For  flows  less  than  8080  cfs,  the  base  rate  is  1  message  every  12 

hours  (2.31  x  10~^  mess/sec).  For  flows  between  8080  and  14450  cfs  the 

base  rate  is  1  message  every  2  hours  (1.39  x  10~^  mess/sec).  For  flows 

-4 

above  14450  cfs  the  base  rate  is  1  mesage  every  30  minutes  (5.55  x  10 
mess/sec).  Parameter  A  is  3.33  x  10  and  a  sample  is  taken  every  1800 
seconds  (0.5  hour)  to  check  for  parameter  changes.  In  summary  the 
reporting  algorithm  is: 

2.31  x  10“5 

or  -4 
1.39  x  10  \ 

or  -4 
5.55  x  10  * 

where  Ax  is  the  change  in  stage  in  the  last  half  hour,  in  feet.  Figure 
6.3  shows  how  the  base  rates  plot  versus  discharge.  The  resulting  step 
function  is  shown  in  dashes. 

To  get  an  idea  of  what  would  be  the  maximum  transmission  rate  at 
Jewett  City,  we  studied  the  largest  flood  on  record  [.42J.  In  August  20, 
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Precip.  depth,  inches 


1 ^bt>  the  peak  discharge  was  recorded  at  40,700  cfs.  Previous  to  that 
event  the  largest  flood  had  been  29,200  cfs.  During  the  August  20, 

1955,  flood,  the  largest  observed  change  of  state  in  one  hour  was  1.1 
feet  which  approximately  gives  a  0.5  ft.  change  in  half  an  hour. 

According  to  tjie  reporting  rate  algorithm  this  would  have  led  to  a 

O 

message  about  every  15  minutes  (0.11  x  10  mess/sec).  This  rate  is 
plotted  with  dots  in  Figure  6.3,  starting  at  30,000  cfs  (the  second 
largest  flood  in  the  record). 

The  eight  reporting  rates  needed  in  our  analysis  (Chapter  4) 
correspond  to  a  peak  reporting  rate  achieved  during  a  rain-flood  event. 
Since  all  stations  within  a  user  utilize  the  same  set  of  reporting 
rates,  it  is  assumed  that  although  different  basins  and  streamflow 
stations  within  a  user  will  achieve  different  peaks,  the  user  would  want 
similar  time  resolution  at  the  peak  for  all  sites. 

It  is  then  reasonable  to  state  that  the  eight  reporting  rates  we 
are  looking  for  should  lie  between  the  lowest  dash  line  and  the  dotted 
line  in  Figure  6.3.  The  rates  and  discharge  break  points  are  determined 
by  arbitrarily  (but  reasonably)  dividing  the  ordinate  and  abscissa 
between  dash  and  dotted  lines  in  Figure  6.3,  resulting  in  the  step 
function  shown  in  solid  lines.  Notice  that  more  sampling  resolution  was 
added  in  the  lower  discharge  ranges,  which  are  the  most  common  and  were 
somewhat  sparse  in  reporting  frequency.  The  abscissa  in  the  Figure  now 
represents  peak  discharge.  It  must  be  translated  to  rainfall 
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accumulation  in  24  hours,  according  to  the  limitations  of  our  analysis. 
In  following  the  above  procedure  it  is  implicitly  assumed  that:  a) 
large  rates  of  change  in  stage  are  associated  with  large  floods,  and  b) 
that  all  DCPs  within  the  influence  of  a  precipitation  station  peak 
simul taneously.  The  first  assumption  is  reasonable  and  the  second  will 
lead  to  conservative  results  in  terms  of  number  of  DCP's  in  a  channel. 

There  are  many  existing  methods  to  translate  peak  discharge  to 
total  rainfall  accumulation.  A  possibility  is  to  use  the  techniques 
suggested  by  the  Soil  Conservation  Service  (43), 

484  A  Pr 

QP  =  — f - -  (6-1) 

P 

where  l)p  is  peak  discharge  in  cfs;  A  is  basin  area  in  square  miles;  Pp 
is  effective  total  precipitation  (runoff)  in  inches,  and  tp  is  time  to 
peak  discharge  from  the  beginning  of  rainfall.  This  time  can  be 
expressed  as  (43) 

tr 

tp  =  ■£—  +  t^  (6.2) 

where  tp  is  the  storm  duration  (hours)  and  t  is  a  "lag"  time,  related 
to  the  invariant  time  of  concentration,  defined  as  the  time  in  hours 
from  storm  centroid  to  peak  discharge.  The  effective  precipitation  or 


runoff  (PE)  can  be  reasonably  obtained  for  any  basin  using  the  Soil 
Conservation  Service  "curve  number"  concept  or  using  traditional 

i 

antecedant  precipitation  index  coaxial  solutions.  The  lag  time  (or  the 
time  of  concentration)  is  again  generally  available  or  reasonably 
estimated. 

For  the  sake  of  this  example  we  "calibrated"  Equation  6.1  to  the 
August  20,  1955,  flood.  From  the  literature  (42)  t  was  estimated  as 
72  hours;  total  gross  rainfall  over  the  period  as  10.5  inches;  and  a 
runoff  coefficient  as  0.6.  This  leads  to  P^  =  6.3  inches  (10.5  x  0.6). 
The  tributary  area  to  Jewett  City  station  is  711  mi.  Qp  was  seen  to  be 
40700  cfs.  Solving  Equation  6.1  results  in  time  to  peak  (tp)  of  53 
hours.  Using  this  value  in  6.2  yields  a  lag  time,  1 1  ,  of  17  hours. 

Having  1 l  it  is  now  possible  to  associate  each  Qp  in  Figure  6.1 
with  a  gross  24-hour  precipitation  accumulation  (still  assuming  a  runoff 
coefficient  of  0.6).  The  Equation  is  simply 


QP 

7119.8 


(6.3) 


The  results  are  indicated  in  the  second  abscissa  in  Figure  6.3. 


From  them,  reasonable  breaking  points  (or  rainfall  intervals  can  be 
defined  for  our  analysis.  They  are  shown  in  the  third  abscissa  in  the 
Figure.  In  summary,  the  reporting  rate  algorithm  for  Jewett  City  is: 


]_ 

b^lmess/sec) 

rainfall  interval 

1 

■  J 

2.315  x  10'5 

0  -  0.01 

2 

7.0  x  10"5 

0.01  -  0.5 

3 

10.5  x  10'5 

0.5  -  1.0 

4 

13.89  x  10“5 

1.0  -  1.25 

5 

34.0  x  10"5 

1.25  -  1.5 

6 

55.5  x  105 

1.5  -  2.0 

7 

75.0  x  10'5 

2.0  -  3.0 

8 

111.0  x  10"5 

>  3.0 

(inches) 


The  above  reporting  rates  will  be  assumed  for  the  New  England 
Division  Corps  of  Engineers  in  our  illustrative  example.  The  rainfall 
intervals  correspond  to  the  ones  used  in  our  analysis  of  the  Techniques 
Development  Laboratory  data. 

6.3  Case  Study  Results 

In  practice  each  district  should  follow  an  exercise  similar  to  the 
one  outlined  above  for  the  New  England  Division.  For  the  purposes  of 
our  hypothetical  case  study,  the  reporting  rates  schedule  for  each 
district  studied  will  be  the  same,  equal  to  that  computed  for  the  New 
England  Division.  It  will  also  be  assumed  that  each  district  will  have 
80  DCPs  and  message  duration  of  3  seconds.  The  unit  load  limitation 


will  be  set  at  5  seconds  per  station.  The  season  studied  will  be  the 
months  of  June,  July  and  August,  season  3,  and  3  repetitions  of  a 
message  are  allowed. 

In  the  first  experiment.  New  York  (28),  Buffalo  (27), 

Baltimore  (30)  and  Pittsburgh  (26)  districts  are  assumed  operating  in  one 
channel.  In  a  second  experiment  we  consider  New  York  (28), 

Cincinnati  (17),  Tulsa  (12)  and  Sacramento  (8).  The  numbers  in 
parenthesis  correspond  to  the  identification  numbers  in  the  computer 
files  containing  our  digitized  nap.  Table  6.1  gives  the  input  data 
sequence  for  both  runs. 

Given  the  experiments  design,  any  differences  in  results  would  be  solely 
due  to  differences  in  precipitation  statistics  of  the  various  group  of  users. 

It  would  be  expected,  and  can  be  confirmed  from  statistics  file  F3S3  that  Case  1 
users  would  show  much  higher  correlations  of  events  than  Case  2  users.  This 
should  lead  to  differences  in  reliability  levels. 

Case  1  results  indicate  that  individual  users  have  little  problems  satis¬ 
fying  unit  loads  or  reliability  criteria.  Also  notice  the  similarity  of  the 
basic  statistics  of  the  users,  means  and  variances  of  reporting  rates.  The 
reliability  of  successful  message  transmission  for  all  users  in  Case  1 
together  is  0.8944.  Remember,  this  implies  that  about  11  percent  of  the  time 
less  than  95%  successful  transmissions  are  achieved. 
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TABLE  6.1 

Input  Data  Sequence  for  Hypothetical  Case  Studies 
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Case  2  results  show  much  larger  differences  in  individual  stations 
statistics.  Nevertheless,  individual  users  again  have  little  trouble  satisfying 
unit  load  and  reliability  criteria.  In  contrast  to  Case  1  the  group  performance 
is  much  better  showing  a  0.9773  probability  of  meeting  the  95%  reliability 
criteria.  This  implies  that  failure  to  satisfy  this  level  of  reliability  occurs 
only  3%  of  the  time  versus  11%  in  Case  1. 

It  seems  that  climatic  effects  can  be  fairly  important  in  network  design. 
The  user  of  this  system  can  experiment  with  various  configurations  of  users 
looking  to  maximize  performance  and  number  of  users  per  channel. 


Cases  1  and  2  Results 
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Appendix  A 


DCPMAIN  USERS  MANUAL 


A. 1  Introduction 

DCPMAIN  is  a  statistical  algorithm  written  in  standard  Fortran  IV, 
for  the  analysis  of  adaptive  random  reporting  data  collection  networks. 
Details  of  the  algorithm  are  given  in  Chapter  4  of  this  report.  This 
software  is  intended  to  serve  as  a  screening  tool  for  the  design  and 
evaluation  of  DCP  networks  adaptively  responding  to  spatially  and  time 
varying  climate.  The  subsections  which  follow  will  provide  users  of 
DCPMAIN  with  the  information  necessary  for  its  use. 

Several  modeling  assumptions  were  made  during  program  development 
and  are  summarized  below: 

1.  A  "user"  is  defined  as  a  Corps  of  Engineers  District  or  any 
other  non-overlapping  geographical  unit. 

2.  All  users  have  the  same  message  length  and  will  strive  for 
message  reliability  using  repetition  Method  2  as  described  in 
the  “Users  Guide  for  Random  Reporting  (6)".  The  number  of 
repetitions  is  taken  to  be  at  least  3,  the  same  for  all  users. 
The  number  of  repetitions  is  a  user  controlled  parameter. 

3.  All  stations  within  a  user  have  the  same  adaptive  reporting 
rate  algorithm.  This  implies  the  same  possible  reporting 
rates.  Different  users  can  have  different  parameters. 


4.  Trading  of  unit  loads  between  users  will  not  be  allowed. 

5.  Each  user  will  attempt  to  use  up  to  their  available  unit 
loads.  A  unit  load  is  presently  defined  as  120  sec.  per 
day  per  station  or  5  sec.  per  hour  per  station,  whatever 

is  limiting  over  the  daily  or  hourly  intervals.  Nevertheless, 
this  is  a  user  controlled  parameter. 

A . 2  Model  Description 

DCPMAIN  evaluates  performance  of  adaptive  random  reporting  data 
collection  platforms  operating  on  a  single  GOES  channel  at  both  local 
and  national  (or  regional)  levels.  The  model  first  evaluates  DCP 
performance  at  the  local  or  user  level.  Following  this,  calculations 
are  made  to  assess  network  performance  on  a  nation-wide  or  channel -wide 
basis.  The  general  structure  of  DCPMAIN  is  illustrated  in  the  logic 
flow  chart  of  Figure  A.l. 

Due  to  the  seasonal  variations  in  the  climatology  of  the  continental 
U.S.,  DCPMAIN  has  been  designed  to  operate  on  a  four  season  basis.  The 
main  input  is  probabilistic  rainfall  information  over  24  hour  periods 
within  seven  depth  intervals.  The  intervals  are  user  defined  in 
program  ZAPPER.  Data  analysis  was  performed  in  this  project  using 
the  following  intervals:  0.0-0. 0: , 0.01 -0.5,  0. 5-1.0,  1.0-1.25,  1.25-1.5 

1. 5-2.0,  2.0-3. 0  and  greater  than  3.0  inches. 

A  brief  description  of  individual  subroutines  fol  lows. 
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Figure  A.l 

DCPMAIN  LOGIC  STRUCTURE 


LOCAL 

Performs  local  design 
calculations  for  user  J 


NATION 

Performs  national 
I  design  calculations 


RCOVAR 

Computes  covariance 
terms  between  users 
for  NATION 


DCPMAIN 


DCPMAIN  is  the  main  or  controlling  routine  for  the  DCPMAIN 
program.  It  controls  the  logical  flow  of  data  through  the  model,  from 
entry  of  input  data  to  the  printing  of  final  results.  All  required  data 
files  are  initialized  and  opened  as  specified  in  the  input  data  stream. 
Evaluation  of  the  probability  of  user  compliance  of  unit  load  limitations 
and  of  the  probability  of  achieving  95%  (or  larger)  message  reception 
reliability  at  the  local  and  national  levels  is  done  in  DCPMAIN  following 
the  concepts  presented  in  Chapter  4. 

SUBROUTINE  INIT 

Subroutine  INIT  initializes  all  major  arrays  and  variables  re¬ 
quired  during  program  execution.  In  addition,  program  default  values 
are  set.  Defaulted  are  units  (or  channels)  numbers  for  input/output 
operations  and  variable  IC0N1  which  is  set  equal  to  the  number  of 
stations  in  the  meteorological  records  analyzed  by  program  ZAPPER 
(see  Appendix  B).  It  is  presently  defaulted  at  255. 

SUBROUTINE  DATAIN 

Subroutine  DATAIN  reads  and  processes  the  input  data  deck  for 
simulation  and  echo  prints  the  input  stream. 
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SUBROUTINE  SEARCH 

Subroutine  SEARCH  reads  and  locates  the  starting  position  of  data 
stored  in  the  user  information  file  (see  next  section)  for  a  specific 
user.  Users  must  be  uniquely  identified  by  numbers. 


SUBROUTINE  ALLOC 

Subroutine  ALLOC  allocates  DCP's  in  a  given  user  to  precipitation 
stations  within  the  user.  The  number  of  stations  within  a  user  are  as¬ 
sumed  uniformly  distributed  and  assigned  to  precipitation  stations  in 
proportion  to  Thiessen  areas  of  a  station  within  a  user.  The  pro¬ 
cedure  is  described  in  Section  5.1  (Equation  5.1)  of  the  main  report. 

In  addition,  a  check  is  performed  to  ensure  that  the  number  of  stations 
and  associated  areas  have  been  properly  recorded  in  the  user  information 
file. 

SUBROUTINE  LOCAL 

Subroutine  LOCAL  performs  the  bulk  of  the  calculations  required 
to  evaluate  local  or  user  design  criteria. 

The  subroutine  computes: 

a)  The  mean  reporting  rate  at  a  station,  x..,  as  described 
by  Equation  4.7. 


b)  The  variance  of  the  reporting  rate,  var[A . .],  as  described 

*  J 

by  Equation  4.9. 


c)  The  covariance  between  all  stations  within  a  user. 

Equation  4.10. 

d)  The  mean  station  reporting  rate  for  a  user,  x  as  in 
Equation  4.6. 

e)  The  variance  of  the  mean  station  reporting  rate,  vartx^] , 
as  in  Equation  4.8. 

The  program  assumes  that  all  DCPs  assigned  by  ALLOC  to  a  given 
precipitation  station  have  the  statistical  properties  of  that  station. 
Therefore,  the  code  avoids  summing  over  all  DCP  as  in  Equations  4.6  and 
4.8,  and  exploits  the  fact  that  many  of  the  terms  in  those  equations 
are  operationally  the  same. 

All  covariance  terms,  also  those  calculated  for  the  national 
design  (subroutine  NATION)  correspond  to  Equation  4.10.  Nevertheless, 
the  program  computes  them  using  an  equivalent  but  more  efficient 
mathematical  formulation.  Term  I  in  Equation  4.10  remains  the  same 
in  the  program.  Terms  2  through  4  are  computed  using: 


Term  2: 


8 


i 


b 


x.  . )P[rain  in  i9  and  not  in  i 

1  1 J  C 
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1 


1. 


[ 


Term  3 
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SUBROUTINE  NATION 

Subroutine  NATION  completes  calculations  at  the  national  level, 
utilizing  the  results  obtained  from  subroutine  LOCAL  and  station  co- 
variance  terms  between  users  computed  in  Subroutine  RCOVAR.  Equations 
4.14  through  4.16  are  used.  Final  output  from  subroutine  NATION  in¬ 
cludes  the  mean  national  DCP  reporting  rate  and  its  variance. 
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SUBROUTINE  RCOVAR 

Subroutine  RCOVAR  calculates  covariance  terms  between  stations  in 
different  users.  Its  results  are  used  by  subroutine  NATION.  The 
algorithm  used  to  compute  covariances  is  identical  to  that  of  sub¬ 
routine  LOCAL. 

A. 3  Program  Data  Requirements 

In  order  to  run  DCPMAIN,  three  data  base  files  are  required,  as  well 
as  an  input  data  deck  (in  disk  file).  The  contents  of  each  of  these 
files  and  file  formats  will  be  presented  in  the  sections  to  follow. 

A. 3.1  User  Information  File 

The  user  information  file,  FIL1 ,  is  a  random  access  file  generated 
by  program  MAPPER  (see  Appendix  b).  Each  user  is  allocated  51  records 
for  the  storage  of  a  user  identification  number  in  the  first  record  fol¬ 
lowed  by  25  records  containing  paired  data  entries  composed  of  a 
precipitation  station  identification  number  and  the  percent  areal 
coverage  of  that  station  within  the  user. 

This  file  begins  in  record  zero  and  may  contain  data  for  up  to 
1260  individual  users.  Nevertheless,  the  analysis  is  presently 
limited  to  36  users.  The  user  identification  number  is  stored  in 
fixed  110  format  and,  therefore,  must  be  an  integer  value  ending  in 
column  10.  Columns  21  through  30  should  contain  0.0. 

Following  the  user  identification  number  record,  25  records  are 


reserved  for  storage  of  precipitation  station  identification  numbersand 
fractional  areal  coverage  of  that  station  within  a  given  user.  These 


values  are  stored  in  fixed  format  beginning  with  the  precipitation 
station  identification  number,  ending  in  column  10,  and  the  fractional 
areal  coverage  of  that  station  located  in  columns  21  through  30. 

The  fractional  areal  coverage  values  should  contain  a  decimal  point  and, 
when  summed  for  a  given  user  should  equal  1.00.  Users  with  fewer  than 
25  paired  station  values  must  contain  records  with  zero  values  for 
both  the  station  identification  number  and  percent  areal  coverage  up 
to  the  25th  record. 

A  sample  user  information  file  for  a  3  user  case  (with  a  6  station  limita¬ 
tion)  and  only  26  records  per  user  is  illustrated  in  Table  A.l. 

A. 3. 2  Precipitation  Station  Probability  Files 

The  precipitation  station  probability  files  are  random  access  files 
generated  by  program  ZAPPER  (Appendix  B).  Four  files  should  be  avail¬ 
able  for  use  containing  seasonal  information.  These  files  are  named 
F2S1 ,  F2S2,  F2S3,  and,  F2S4  representing  "file  2,  season  1"  through 
"file  2,  season  4."  Since  runs  are  made  on  a  seasonal  basis,  only 
one  of  the  above  files  is  required  for  an  individual  simulation.  This 
file  must  correspond  to  the  season  of  interest. 

Each  record  contains  a  precipitation  station  identification 
number,  a  season  flag  value,  the  probability  of  no  precipitation  for  that 
station,  and,  seven  probabilities  of  precipitation  within  the  given 
depth  intervals  (given  that  it  rains).  The  records  are  arranged  in 
sequential  order  by  precipitation  station  identification  number, 
beginning  with  record  one  and  ending  with  record  255  (or  the  number  of 
precipitation  stations  analyzed  by  program  ZAPPER). 


Sample  User  Information  File,  FIL1 
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Data  values  are  stored  in  fixed  format  with  an  integer  station 
identification  number  ending  in  column  10,  an  integer  season  flag 
value  (i.e.,  1,  2,  3,  or  4)  located  in  column  20,  and,  the  eight 
probability  values  located  in  columns  21  through  100.  Probability  values 
are  stored  as  real  values  to  three  significant  digits  in  fields  of  10 
columns  each.  As  a  check,  the  sum  of  the  seven  depth  interval  proba¬ 
bilities  should  equal  1.0. 

A  sample  precipitation  station  probability  file  for  season  1  is 
shown  in  Table  A. 2,  again  for  a  3  user,  six  climatological  stations  case. 

A. 3. 3  Conditional  Probability  Paired  Station  File 

The  conditional  probability  paired  station  files  are  random  access 
files  generated  by  program  ZAPPER.  As  with  the  precipitation  station 
probability  files,  four  files  should  be  available,  by  season.  These 
files  are  called  F3S1 ,  F3S2,  F3S3  and,  F3S4  corresponding  to  "file  3, 
season  1,"  etc.  Here  again,  only  one  of  the  above  files  is  required 
for  an  individual  run  for  the  season  of  interest. 

Each  record  of  the  conditional  probability  paired  station  file 
contains  two  integer  station  identification  numbers  followed  by  an 
integer  season  flag  value  and  each  of  the  following  probabilities: 

P[no  rain  in  station  i-j,  and  no  rain  in  station  ig]  1  P[no  rain  in  station 
i  1 ,  and  rain  in  station  i2];  P[  rain  in  station  i-|,  and  no  rain  in 
station  i2];  P[rain  in  station  i-j,  and  rain  in  station  i^]. 

Data  values  are  stored  in  fixed  format  beginning  in  record 
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10  and  20  for  stations  and  ig.  respectively.  The  season  flag 
value  (i.e.,  1,  2,  3,  or  4)  is  located  in  column  30  and  joint  station 
probabilities  are  located  in  columns  31  through  .70.  Each  probability 
value  is  stored  as  a  real  value  to  three  significant  figures  in  fields 
of  10  columns  each.  The  paired  station  values  are  ordered  as  shown 
in  Table  A. 3,  again  for  a  limited  example  of  3  user,  6  stations. 
Probabilities  in  each  record  should  sum  to  1.0. 


A. 3.4  Data  Input  Deck 

The  data  input  deck  for  DCPMAIN,  file  FIL0,  is  a  sequential  file 
read  by  subroutine  DATAIN.  All  data  values  are  in  entered  in  fixed 
format.  Each  of  the  data  cards  required  is  described  below. 

Card  1 :  The  first  card  of  the  data  input  deck  contains  the  number  of 

users  for  the  simulation.  The  current  configuration  of  the  pro¬ 
gram  allows  for  a  maximum  of  36  users  per  simulation.  This 
card  is  read  in  110  format,  and,  therefore,  should  be  right 
justified  ending  in  column  10. 

Cards  2  and  3:  The  next  set  of  cards  contains  the  required  input 
information  for  user  one,  the  first  of  which,  card  2, 
contains  two  integer  values.  The  first  value  is  the  user 
identification  number  for  user  one.  This  must  correspond  to 
a  user  identification  number  in  the  user  information  file,  or 
execution  will  be  terminated.  The  second  value  on  card  2  is 
the  number  of  DCP's  assigned  to  user  one.  This  should  be 
right  justified  and  end  in  column  20. 
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Card  3  contains  the  eight  base  reporting  rates  for  user 
one  (see  Chapter  4).  These  are  entered  in  units  of  trans¬ 
missions  per  second  beginning  with  the  transmission  rate 
during  dry  periods  and  ending  with  the  maximum  transmission 
rate  for  rainfall  accumulations  greater  than  3.0  inches. 

All  values  are  entered  in  fields  of  10  columns  leaving  the 
first  column  blank,  i.e.,  8(1X,E9.7)  format.  Therefore, 
data  entries  should  be  right-justified  and  end  in  columns 
10,  20,  30  ...  and,  80. 

If  multiple  users  are  to  be  simulated.  Cards  2  and  3  are 
simply  repeated  with  appropriate  values  for  each  user.  The 
total  number  of  user  groups  (i.e..  Card  2  and  Card  3  com¬ 
binations)  must,  however,  correspond  to  the  number  of  users 
specified  on  Card  1  of  the  input  data  deck. 

Card  4:  Card  4  contains  the  season  flag  value  for  the  simulation 
run.  This  is  read  in  fixed  format  and  must  be  located  in 
column  10.  Feasible  values  range  from  1  to  4  and  correspond 
to  season  1,  season  2,  season  3  and  season  4  of  the  precipi¬ 
tation  station  probability  files  and  conditional  probability 
paired  station  files,  respectively. 

Card  5:  Card  5  of  the  input  stream  contains  the  maximum  transmission 
duration,  in  seconds  per  hour,  specifying  a  unit  load  per  0CP. 
Currently,  NESS  has  defined  a  unit  load  as  five  seconds  per 
hour  per  DCP.  This  value  is  entered  in  columns  1  through  10 
and  must  contain  a  decimal  point.  This  is  assumed  constant 
for  all  users. 
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Card  6:  Card  6  contains  the  number  of  transmission  repetitions  for 
each  DCP.  This  is  an  integer  value  read  in  110  format,  and, 


therefore,  must  be  right-justified  ending  in  column  10. 

Generally  three  repetitions  correspond  to  a  desired  .95 
probability  of  successful  transmission.  Five  repetitions 
correspond  to  .99  probability  of  successful  transmission. 

Card  7:  The  final  card  in  the  input  stream.  Card  7,  contains  the 
message  transmission  duration  in  seconds.  This  is  a  real 
value  entered  in  columns  1  through  10  and  must  contain  a 
decimal  point.  Transmission  durations  are  also  assumed  con¬ 
stant  for  all  DCP's. 

A  sample  input  data  stream  for  three  users  is  listed  in 
Table  A. 4.  For  user  100,  simulation  will  be  performed  for 
a  total  of  15  platforms.  Users  200  and  500  contain  10  and 
5  platforms  each.  The  season  selected  is  season  1.  A 
unit  load  has  been  defined  as  5  transmissions  per  hour  per 
user  with  a  transmission  duration  of  2.0  seconds  and  3 
repetitions  per  message. 

A. 4  Program  Output 

Results  from  the  program  are  best  illustrated  with  a  small  example. 
Tables  A.l  through  A. 4  correspond  to  input  data  of  a  hypothetical  case 
study  where  the  performance  of  3  users  in  a  single  channel  is  being 
evaluated.  Figure  A. 2  is  a  schematic  diagram  of  the  3  users  configuration, 
with  available  climatological  stations,  shown  by  a  triangle  and  an 
approximately  uniformly  distributed  0CP  shown  by  circles. 
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The  output  follows  in  Table  A. 5.  First  the  user  information  input 
is  echo-printed.  It  indicates  the  number  of  users,  their  identification 
numbers,  the  number  of  DCP's  per  user  and  the  8  reporting  rates  of 
each  user.  Also  given  is  the  unit  loading  factor  in  seconds  per  hour; 
the  message  repetition  rate;  and  the  message  duration  in  seconds. 

Results  of  the  local  design  follow.  For  each  user,  the  mean 
reporting  rate  per  station  and  the  variance  of  the  reporting  rate  are 
given.  The  mean  has  units  of  messages  per  second.  The  variance  has 
the  same  units  squared.  The  differences  in  means  and  variances  are 
due  to  different  reporting  rates,  different  climatological  probabilities 
and  different  number  of  DCP's. 

After  all  users  statistics  are  given,  the  program  outputs  the 
mean  reporting  rate  per  station  for  all  users  in  the  channel  (the 
"nation”)  and  the  variance  of  the  reporting  rate  of  the  nation. 

Finally,  design  criteria  is  output  for  all  users  and  the  nation. 

For  example,  at  the  local  level,  user  200  with  10  stations  satisfies 
the  unit  load  limitation  approximately 29%  of  the  time  (violates  it 
71%  of  the  time).  Nevertheless,  95%  reliability  (since  3  repetitions 
are  made)  is  achieved  nearly  100%  of  the  time.  Also  given  for  each  user 
is  the  maximum  deterministic  reporting  rate  allowed  to  be  within  the 
unit  load  requirement  100%  of  the  time.  Equation  4.11;  and  the  rate 
that  would  be  required  as  an  absolute  maximum  to  have  95%  or  more 
reliability  of  message  reception  100%  of  the  time.  Equation  4.12. 

Similar  information  is  given  for  the  national  level.  In  the  example 
95%  reliability  of  message  reception  is  achieved  nearlyioo%of  the  time 
at  the  national  level  . 
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A . 5  Computer  Equipment  and  Machine-Specific  Issues 


According  to  the  contract,  DCPMAIN  was  implemented  in  the  NED 
Water  Control  Branch  computer  facilities.  The  machine  used  was  a  Data 
General  N0VA3-12  with  64K  words  of  memory  and  two  disk  drives.  A 
large  number  of  instructions  appearing  in  the  program  are  unfortunately 
machine-specific.  They  deal  mostly  with  overlays  definitions  (required 
for  execution  in  such  a  small  machine)  and  in  all  input/output  related 
statements,  particularly  file  handling  instructions. 

The  overlay  structure  is  the  following.  The  main  program  addresses 
the  following  groups  of  subroutines: 


Subroutine 

Overlay 

Group  1 

INIT 

1 

Group  2 

DA TAIN 

2 

Group  3 

SEARCH 

3 

ALLOC 

4 

LOCAL 

5 

Group  4 

NATION 

6 

RCOVAR 

_ 

All  overlay  files  are  stored  in  a  machine-readable  file  DCPMAIN. OL. 
Following  is  a  summary,  by  subroutine,  of  the  main  types  of  machine- 
specific  statements. 
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DCPMAIN 


-  All  CALL  OVOPN 

-  All  CALL  OVLOD 

-  EXTERNAL  one,  two,  three,  four,  five,  six 

-  All  CALL  OPEN 

-  All  CALL  FOPEN 


-  All  TYPE  statements 

-  An  "x"  in  column  1  signals  the  compiler  to  only  load  these 
statements  if  requested  during  compilation 


INIT 

-  OVERLAY  ONE 

-  Initialization  of  unit  numbers  or  channels,  which  will  be 
different  for  other  machines 

SEARCH 

-  OVERLAY  THREE 

-  The  random  access  file  read  begins  in  record  zero.  This  may 
be  a  problem  on  other  machines. 

-  CALL  FSEEK 

-  TYPE  STATEMENTS 

DATA IN 

-  OVERLAY  TWO 

-  TYPE  STATEMENTS 
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ALLOC 


-  OVERLAY  FOUR 

-  TYPE  STATEMENTS 

-  CALL  FSEEK 


LOCAL 

-  OVERLAY  FIVE 

-  Calls  to  FSEEK 

-  TYPE  STATEMENTS 

NATION 

-  OVERLAY  SIX 

RCOVAR 

-  Calls  to  FSEEK 

-  TYPE  STATEMENTS 


A . 6  Changes  to  DCPMAIN 

Future  changes  to  DCPMAIN  will  probably  deal  with  changing  the 
number  of  precipitation  stations  affecting  a  user;  changing  the  number 
of  users;  or  changing  the  number  of  precipitation  stations  used  in 
obtaining  the  probabilistic  information.  The  following  sub-sections 
summarize  the  main  actions  to  take  if  the  above  changes  are  desired. 
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A. 6.1  Changing  the  Number  of  Precipitation  Stations  Affecting  a  User 


General  Changes  -  All  Subroutines/DCPMAIN 

-  Change  all  variables  in  COMMON  currently  dimensioned  to  25 
to  whatever  the  maximum  number  of  precipitation  stations  per 
user  is  desired. 

-  Change  all  DIMENSIONS  STATEMENTS  from  25  to  whatever  is  desired. 

-  Change  all  DO  LOOPS  currently  set  for  25  repetitions  to  whatever 
is  desired. 


Subroutine-Specific  Changes 
SUBROUTINE  SEARCH 

Change  statement: 

IREC  =  IREC  +  51  to 
IREC  =  IREC  +  I 

where  I  is  the  maximum  total  number  of  records  per  user  in  the 
User  Information  File,  F I  LI ,  including  the  user  identification  rubber. 

SUBROUTINE  ALLOC 

Change  statement: 

IRECE  =  IRECS  +  24  to 
IRECE  +  IRECS  +  J 

where  J  is  the  maximum  number  of  precipitation  stations  per 
user  minus  1 . 
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A. 6. 2  Changing  Number  of  Users 


General  Changes  -  All  Subroutines/DCPMAIN 

-  Change  all  variables  in  COMMON  currently  dimensioned  to  36  to 
whatever  maximum  number  of  users  is  desired. 

-  Change  all  DO  LOOPS  currently  set  for  36  repetitions  to  whatever 
maximum  number  of  users  is  desired.  These  occur  mostly  in  Sub¬ 
routine  IN  I T . 


A. 6. 3  Changing  Number  of  Precipitation  Stations  Used  in  Probabilistic 
Analysis 

In  order  to  change  the  number  of  precipitation  stations  in  the 
analysis  (i.e.,  stored  in  files  FIL1,  F2S1,2,3,4,  F3S1,2,3,4)  variable 
I CONI  in  Subroutine  INIT  must  be  reset  to  exactly  the  number  of  stations 
analyzed  by  program  ZAPPER. 


A.  7  The  Mechanics  of  Using  DCPMAIN  in  the  NED  Data  General  Computer 

All  subroutines  are  currently  stored  is  separate  files.  Therefore, 
after  editing  an  individual  subroutine  it  must  be  recompiled  by  typing: 

FORT  subroutine  file  name 

The  subroutine  file  names  are: 


File 

Subroutine 

DCPMAIN 

Main  Program 

DINIT 

IN  IT 

DATA IN 

DATA IN 

SEARCH 

SEARCH 

ALLOC 

ALLOC 

LOCAL 

LOCAL 

NATION 

NATION 

RCOVAR 

RCOVAR 

Due  to  the  overlay  structure,  the  entire  program  must  be  reloaded. 
This  is  done  by  typing: 

DCPLOAD 


or 


RLDR  DCPMAIN[DINIT,DATAIN, SEARCH  ALLOC  LOCAL, NATION  RCOVAR] 
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To  execute,  type: 


DCPMAIN 


Output  from  DCPMAIN  is  stored  on  disk  in  file  FIL5.  To  print 
results  on  your  terminal,  enter  the  following: 


TYPE  FIL5 


When  working  on  the  Tektronix,  printout  may  be  routed  to  the 
Line  Printer  as  follows: 


PRINT  FIL5 


Make  sure  to  position  page  before  hitting  return. 
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APPENDIX  B 


Preprocessing  Program 
Documentation 


B.l  Introduction 

This  Appendix  contains  program  listings  and  information  on  use  of  a 
series  of  programs  designed  to  provide  the  necessary  input  data  file, 
for  application  of  DCPMAIN,  as  described  in  Appendix  A.  Four  separate 
programs  were  developed  as  follows: 

1.  CAPPtR  -  designed  to  take  as  input  the  Corps  of  Engineers 
district  boundaries  and  generate  a  digitized  map  of  the 
conterminous  United  States  C  of  E  districts; 

2.  MAPPER  -  using  output  map  and  raingage  station  location  data, 
evaluates  Thiessen  areas  of  influence  for  all  gauging  stations 
and  all  districts  of  interest  producing  FIL1,  the  user 
information  file; 

3.  ZAPPER  -  developed  to  process  the  National  Weather  Service 
Techniques  Development  Laboratory  data  set  of  6-hour 
rainfall  amounts  for  255  gauging  stations,  and  generate 
statistics  on  numbers  on  rainfall  events  by  depth  interval  and 
number  of  events  of  joint  occurrence  of  rainfall  between  pairs 
of  stations; 


4.  TAPPER  -  takes  number  of  events  files  generated  and  converts 
to  probabilities  of  occurrence,  the  so-called  files  F2S1,  F2S2, 
F2S3,  F2S4,  F3S1 ,  F3S2,  F3S3,  and  F3S4  files  required  for 
use  by  DCPMAIN. 

These  preprocessing  programs  are  documented  in  the  sections  that 
follow,  but  the  purpose  is  not  to  provide  a  comprehensive  users  manual. 
The  programs  were  designed  to  analyze  and  generate  all  of  the  key 
statistics  required  by  DCPMAIN  regardless  of  the  manner  in  which  the  C 
of  E  districts  were  to  be  assigned  to  channels  for  random  reporting. 

The  data  needed  for  considering  of  the  existing  36  C  of  E  district  users 
for  the  conterminous  United  States  were  generated  in  the  course  of 
executing  these  preprocessing  modules.  Thus,  these  programs  were 
designed  to  be  executed  only  once  --  and  need  not  concern  the  user  of 
DCPMAIN,  only  the  output  files  from  these  models  which  have  been 
supplied  with  DCPMAIN,  are  required.  However,  should  a  change  be 
required  in  the  basic  input  used  in  preprocessing,  namely,  the  existing 
36  Corps  districts  and  the  use  of  the  255  station  NWSTDC  Data  set,  then 
the  preprocessing  modules  would  have  to  be  rerun. 
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u  *  2  program  CAPPER 


Program  CAPPER  written  in  standard  Fortran  IV  was  implemented  and 
executed  on  the  NED  Data  General  NOVA  3-12  discussed  in  Section  A. 5. 

The  program  listing  is  included  at  the  end  of  this  section,  followed  by 
file  BOUNDARY,  the  input  file  for  CAPPER. 

As  currently  configured,  the  program  is  designed  to  construct  a 
digitized  map  for  which  each  grid  point  is  associated  to  a  separate 
district.  The  key  controlling  parameters  initialized  in  the  program 
are: 

NLAT  -  the  number  of  grid  points  into  which  the  y-axis  of  the  map 
is  discretized  (rows); 

NLON  -  the  number  of  grid  points  into  which  the  x-axis  is 
discretized  (columns); 

NDIS  -  the  maximum  number  of  distinct  districts. 

As  executed,  the  map  grid  was  intended  to  represent  the  continental 
United  States  C  of  E  districts,  totaling  NDIS  =  36.  The  base  map  used 
co  develop  the  digitized  representation  was  the  December  197C  Division 
ario  District  Boundaries  tor  Civil  Works  Activities,  where  the  map  grid 
was  divided  into  DLAT  =  136  by  NLUN  =  224  points.  At  this  level  of 
discretization  each  grid  point  was  roughly  2G4  square  miles. 

The  reason  the  digitized  map  was  not  input  directly  was  that  the 
amount  of  input  information;  required,  namely  the  district  number 
associated  to  each  of  the  3U464  points  in  the  grid,  was  too  massive. 
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Instead  it  was  recognized  that  all  that  was  required  to  construct  the 
map  was  data  on  the  points  at  which  district  numbers  changed,  and  the 
nature  of  the  change,  in  each  imaginary  grid  row  through  the  map.  Using 
an  overlay,  this  information  was  developed  and  input  into  file  BOUNDARY. 
Each  record  in  the  file  contains: 

a)  the  number  of  the  row  of  interest  (1  to  NLAT); 

b)  the  number  of  the  district  to  which  the  map  changes,  in  the 
row  of  interest,  at  and  beyond  the  boundary  location;  and 

c)  the  column  location  of  the  first  point  of  the  associated 
district  ll  to  NLGN). 

The  program  assumes  that  district  36  is  assigned  to  all  locations  on  the 
map  outside  of  a  C  of  E  district  (i.e.,  in  oceans,  foreign  territories), 
and  that  ai strict.  99  is  a  flag  used  to  indicate  a  transition  to  the  next 
map  row. 

The  program  logic  is  simple.  It  loops  over  the  rows  of  the  map, 
reading  from  rile  BOUNDARY,  and  constructs  the  points  in  each  row  based 
on  the  input  boundary  information.  As  a  final  check,  the  map, 
transposed  from  the  input  orientation,  is  printed  on  the  line  printer. 
This  was  used  to  verify  the  input  boundary  information,  as  over  IbSO 
records  were  requireo  ns  the  BOUNDARY  file.  The  digitized  map  output  is 
shown  as  Figure  6.?..  This  output  information  was  written  to  a  random 
access  file  MAP  used  by  program  i'lAPPEk,  described  below,  to  compute 
Thitssen  areas. 
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d.3  Program  MAPPER 

Program  MAPPER,  written  in  standard  Fortran  IV  was  also  implemented 
and  executed  in  the  NED  Data  General  Nova  3-12.  The  program  listing  is 
included  at  the  end  of  this  section,  followed  by  file  COORD,  which  along 
with  MAP,  was  the  input  for  MAPPER.  Also  included  is  file  AREAS, 
containing  the  output  from  program  MAPPER. 

As  developed,  the  program  is  designed  to  use  the  digitized  map 
representation  of  districts,  and  coordinate  locations  of  rain  gauge 
stations  of  interest,  to  compute  the  Thiessen  areas  of  influence  of  the 
stations  on  each  of  the  C  of  E  districts.  The  other  key  controlling 
parameter,  in  addition  to  NLAT,  NLON,  and  NDIS  defined  to  characterize 
the  digitized  map,  is: 

NSTA  -  the  number  of  precipitation  measuring  stations.  As 
executed,  the  number  of  stations  was  set  as  NSTA  =  255,  representing  the 
stations  in  the  NWS  TDL  data  set. 

In  addition  to  the  MAP  file  generated  by  CAPPER,  the  other 
requisite  input  was  the  coordinates  of  the  255  rain  gauge  stations. 

Using  the  same  overlay  and  base  map  used  to  define  the  digitized  map, 
the  coordinates  of  the  rain  gauges  were  defined  using  the  136  by  224 
grid  system.  Each  record  of  the  COORD  file  contains: 

a)  Station  Number  (1  to  NSTA); 

b)  The  column  number  (longitude)  of  the  station  (ILON);  and 

c)  The  row  number  (latitude)  of  the  station  (HAT). 
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Stations  in  the  255  location  set  which  were  out  of  the  conterminous 
United  States  (i.e.,  those  in  Alaska,  Hawaii,  and  Puerto  Rico)  and 
station  numbers  75,  201,  and  218  (for  which  the  NWSTDL  data  was 
insufficient),  were  assigned  coordinate  locations  HAT  =  136,  ICON  =  0, 
corresponding  to  a  point  in  the  extreme  lower  left  corner  of  the  map. 
This  assignment  insured  that  data  from  these  station  would  not  have  any 
influence  on  the  36  C  of  E  districts. 

The  program  logic  is  relatively  straightforward.  For  each  of  the 
30,464  grid  points  in  the  map,  the  closest  of  the  255  precipitation 
stations  was  determined.  This  determination  was  made  based  on  the  known 
coordinates  of  the  grid  point  and  precipitation  station.  Also  known 
from  the  MAP  file  was  the  district  with  which  the  map  grid  point  was 
associated.  Using  this  information,  a  file  of  numbers  of  grid  points, 
by  district,  closest  to  each  of  the  255  rainfall  stations  is 
constructed.  These  data  are  then  normalized  by  the  total  number  of  grid 
points  (area)  in  each  district.  The  resulting  non-zero  Thiessen  areas 
of  influence,  and  associated  rainfall  station  numbers,  were  output  into 
file  AREAS.  This  file  was  constructed  so  that  the  first  record  contains 
the  C  of  E  district  number,  the  next  records  contain  non-zero  station 
numbers  and  associated  Thiessen  areas,  and  all  of  the  remaining  records 
until  record  51,  contain  zeros.  File  AREAS,  shown  at  the  end  of  this 
section,  with  minor  editing  for  consistency,  is  the  file  FIL1  required 
by  DCPMAIN. 
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B.4  Program  ZAPPER 


Program  ZAPPER  written  in  standard  Fortran  77  was  implemented  and 
executed  on  the  NED  Harris  facilities.  This  computer  system  is 
supported  by  the  Automatic  Data  Processing  (ADP)  Center  at  the  New 
England  Division  Headquarters,  and  by  21  other  C  of  E  groups.  The 
program  listing  is  included  at  the  end  of  this  section,  followed  by  a 
sample  of  the  input  file  DATAIN,  and  sample  output  files  ZAPPERF1  and 
ZAPPERF5. 

Program  ZAPPtR  is  designed  to  process  the  NWSTDL  rainfall  data  set 
to  evaluate  the  number  of  events  by  class  of  depth  interval,  and  the 
number  of  joint  occurrences  of  rainfall  or  no  rainfall  between  different 
stations,  for  all  stations  in  the  data  set.  The  basic  program  logic  is 
to  obtain  a  day's  rainfall  data,  evaluate  the  depth  class  interval  for 
each  station  of  interest  to  which  the  event  is  associated,  evaluate  the 
joint  occurrence  class  of  interest,  up-date  the  event's  counter,  and 
record  then  on  a  seasonal  basis.  The  key  controlling  parameters 
utilized  in  the  program  include: 

NSTA  =  the  number  of  rainfall  data  stations  in  the  data  set; 

NSEAS  =  The  number  of  seasons  of  interest;  and 

NDTS  =  the  number  of  depth  class  intervals. 

For  the  current  analysis,  data  was  available  for  NSTA  =  255  stations, 
and  this  was  evaluated  for  NSEAS  =  4  seasons  into  NDTS  =  8  depth  class 
intervals. 

By  way  of  brief  explanation,  this  program  was  originally  configured 
for  execution  on  the  C  of  E  Data  General  System.  Because  a  large 
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number  of  data  points  (255  stations  by  8  depth  classes  and  255  by  255 
stations  for  4  joint  daily  occurrence  classes  --  all  for  4  seasons)  were 
to  be  accounted,  and  because  the  Data  General  System  is  only  a  64K 
machine,  the  program  originally  kept  a  small  subset  of  the  number  of 
events  accumulators  in  core.  Thus,  in  order  to  execute  the  program, 
frequent  reads  from  and  writes  to  disc  were  required.  This  might  have 
been  acceptable  were  a  small  data  set  to  be  analyzed.  However,  the  9 
years  of  6-hourly  data  for  255  stations  meant  an  analysis  of  3287  days 
of  record,  or  nearly  1.2  million  data  points.  Experimental  runs 
indicated  that  hundreds  of  days  of  execution  time  would  have  been 
required  on  the  Data  General.  Accordingly,  the  program  was  moved  to  the 
faster  and  larger  Harris  System,  and  successfully  executed  on  this 
system  in  4.25  hours  of  CPU  time. 

The  program  requires  as  input  a  file  called  DATAIN  which  was 
constructed  to  contain  rainfall  records  on  a  6-hourly  basis  for  each  of 
255  stations.  The  first  record  of  each  group  in  the  file  contains  the 
rainfall  data  line,  namely: 

a)  year; 

b)  month; 

c)  day;  and 

d)  hour 

of  the  event  of  interest.  This  is  followed  by  rainfall  data  for  each  of 
the  255  stations  of  interest,  written  into  22  records.  The  file  was 
accessed  sequentially  for  the  9  years  of  records. 
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Alter  initializing  the  key  parameters,  numbers  of  events 
accumulators,  and  opening  the  input  and  output  files,  the  program  loops 
over  the  days  of  input  data  record.  The  6-hour  data  is  accumulated  into 
24-hour  data,  the  event  of  interest.  Then  for  each  station,  the  depth 
interval  of  interest,  namely; 


1. 

0.00  - 

0.01 

inches  ("no 

2. 

0.01  - 

0.50 

inches 

3. 

0.50  - 

1.00 

inches 

4. 

1.00  - 

1.25 

inches 

5. 

1.25  - 

1.50 

inches 

6. 

1.50  - 

2.00 

inches 

7. 

2.00  - 

3.00 

inches 

8. 

3.00+ 

inches 

is  determined.  The  number  of  events  by  interval  is  maintained  for  each 
station,  on  a  seasonal  basis.  These  data  are  recorded  in  the  ZAPPERF1 , 
ZAPPER  F2,  ZAPPERF3,  and  ZAPPERF4  for  the  four  seasons,  respectively. 
Seasons  are  defined  as  follows: 

1  =  December,  January,  February 

2  =  March,  April,  May 

3  =  June,  July,  August 

4  =  September,  October,  November- 

Then  for  each  station  i,  the  numbers  or  events  in  each  of  four  classes 
of  joint  occurrence  with  other  stations  j,  are  determined,  namely: 
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1. 


no  rainfall  at  station  i  and  no  rainfall  at  station  j; 

2.  no  rainfall  at  station  i  and  rainfall  at  station  j; 

3.  rainfall  at  station  i  and  no  rainfall  at  station  j;  and 

4.  rainfall  at  station  i  and  rainfall  at  station  j. 

This  provides  the  information  needed  to  evaluate  the  joint  probability 
of  occurrence  of  events  between  stations.  Because  of  the  symmetry 
involved,  if  the  numbers  of  events,  in  the  same  class,  between  some  pair 
of  stations  i  and  j  are  known,  it  is  not  necessary  to  evaluate  the 
events  for  stations  j  and  i,  the  same  pair  of  stations  in  reverse  order. 
Information  on  the  joint  occurrence  of  events  is  stored  in  files 
ZAPPERF5,  ZAPPERF6,  ZAPPERF7 ,  and  ZAPPERF8  for  the  four  seasons  1,  2,  3, 
and  4,  respectively.  As  the  program  steps  over  time,  only  the  current 
season  of  events  accumulations  is  kept  in  core. 

The  output  station  files  ZAPPERF1  to  ZAPPERF4  contain  in  each 
record  the  following  data: 

1.  station  number  (1  to  255) 

2.  season  number  (1  to  4) 

3.  number  of  events  for  each  of  the  depth  interval  classes. 

There  are  255  records  in  each  of  the  four  files.  A  sample  from  file 
ZAPPERF1  is  shown  at  the  end  of  this  section. 

The  output  cross  station  files  ZAPPEKF5  to  ZAPPERF8  contain  in  each 
record  the  following: 

1.  first  station  number  (I  ~  i  to  255) 

2.  second  station  number  (J  =  I  +  1  to  255) 
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3.  season  number  (1  to  4) 

4.  number  of  events  for  each  of  4  joint  occurrence  classes. 

There  are  32,385  records  in  each  of  the  four  files  (the  number  of  points 
in  the  upper  half  of  a  255  by  255  matrix). 

These  8  files  contain  numbers  of  events  and  are  generated  as  a 
intermediate  product.  The  information  of  interest  for  DCPMAIN  is  in 
fact  the  probabilities  of  events.  Program  TAPPER  converts  the  numbers 
of  events  into  probabilities  of  occurrence,  as  described  in  the  section 
that  follows. 

B.5  Program  TAPPER 

Program  TAPPER  written  in  standard  Fortran  77  was  implemented  and 
executed  on  the  N.E.D.  Harris  system,  described  in  the  previous  section. 
The  program  listing  is  included  at  the  end  of  this  section,  followed  by 
sample  output  from  files  TAPPERF1  and  TAPPERF5. 

Program  TAPPER  is  a  relatively  simple  program  designed  to  take  the 
output  files  from  program  ZAPPER  and  normalize  the  numbers  of  events  to 
probabilities  of  occurrence  of  events  of  interest. 

The  program  reads  each  of  the  station  files  ZAPPER1  to  ZAPPER4, 
determines  the  total  number  of  events  for  each  station  in  each  season, 
and  then  computes  and  writes  the  TAPPERF1  to  TAPPERF4  which  contain  the 
following  data  in  each  record: 

1.  station  number  (1  to  255) 

2.  season  number  (1  to  4) 
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2.  probability  of  no  rain 

4.  probability  of  rain  in  depth  interval  2  to  8  (see  Section  B.4), 
given  that  it  rained. 

These  files  TAPPERF1 ,  TAPPERF2,  TAPPERF3,  and  TAPPERF4  are  exactly  the 
input  files  F251 ,  F252,  F253,  and  F254  needed  as  input  to  DCPMAIN.  A 
sample  is  included  at  the  end  of  this  section.  Each  file  has  255 
records. 

In  an  analogous  manner,  ZAPPERF5  to  ZAPPERF8  were  read  in  and 
normalized  to  combine  in  each  record: 

1.  first  station  number  (1=1  to  255) 

2.  second  station  number  (J  =  I  +  1  to  255) 

3.  season  number  (1  to  4) 

4.  probability  of  events  in  each  of  4  joint  occurrence  classes. 
These  output  files,  TAPPERF5,  TAPPERF6,  TAPPERF7,  and  TAPPERF8  are 
exactly  the  input  files  F351,  F352,  F353,  and  F354  needed  as  input  to 
DCPMAIN.  A  sample  is  included  at  the  end  of  this  section.  Each  file 
has  32,385  records. 
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